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UBIQUITIN-IiYTIC PEPTIDE FUSION GRNB CONSTRUCTS, PROTBXN PRODUCTS 
DERIVING THBRBFROH, AND METHODS OF MAKING AND USING SAMB 

5 CROSS REFERENCE TO RELATED APPLICATIONS . 

This application is a continuation-in-part of Application No. 
08/231,730, filed April 20, 1994 in the names of Jesse M. Jaynes 
and Gordon R. Julian, which in turn is a continuation of 

10 Application No. 08/225,476, filed April 8, 1994 in the names of 
Jesse M. Jaynes and Gordon R. Julian, which is in turn a 
continuation of Application No. 08/148,491, filed November 8, 1993 
and Application No. 08/148,889, filed November 8, 1993, both filed 
in the name of Gordon R. Julian, which are in turn continuations 

15 of Application No. 08/039,620, filed June 4, 1993 in the name of 
Jesse M. Jaynes and Gordon R. Julian. 

BACKGROUND OF THE INVENTION 

20 Ft eld of the. Invention 

The present invention relates to ubiquitin-lytic peptide 
fusion gene constructs with enhanced stability and gene 
expression, ubi qui tin- lytic peptide fusion protein products, and 
methods of making and using the same. 

25 

Description of Related Art 

Naturally occurring lytic peptides play an important if not 
critical role as immunological agents in insects and have some, 
albeit secondary, defense functions in a range of other animals. 
30 The function of these peptides is zo destroy prokaryotic and other 
non-host cells by disrupting the cell membrane and promoting cell 
lysis. Common features of these naturally occurring lytic 
peptides include an overall basic charge, a small size (23-39 
amino acid residues), and the ability to form amphipathic a- 

35 helices or S-pleated sheets . Several cypes of lytic peptides have 
been identified: cecropins (described in U.S. Patents 4,355,104 
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and 4,520,016 to Hultmark et al.) , defensins, sarcotoxins, 
melittin, and magainins (described in U.S. Patent No. 4,810,777 to 
Zasloff ) . Each of these peptide types is distinguished by 
sequence and secondary structure characteristics . 
5 Several hypotheses have been suggested for the mechanism of 

action of the lytic peptides: disruption of the membrane lipid 
bilayer by the amphipathic a-helix portion of the lytic peptide; 

lytic peptide formation of ion channels, which results in 
osmotically induced cytolysis; lytic peptide promotion of protein 
10 aggregation, which results in ion channel formation; and lytic 

peptide-induced release of phospholipids. Whatever the mechanism 
of lytic peptide-induced membrane damage, an ordered secondary 
conformation such as an amphipathic a-helix and positive charge 

density are features that appear to participate in the function of 

15 the lytic peptides. 

Active synthetic analogs of naturally occurring lytic 
peptides have been produced and tested in vitro against a variety 
of prokaryotic and eukaryotic cell types (see for example 
Arrowood, M.J., et al., J. Protozool. 38: 161s [1991]; Jaynes, 

20 J.M., et al., FASEB J. 2: 2878 [1988]), including: gram positive 
and gram negative bacteria, fungi, yeast, protozoa, envelope 
viruses, virus-infected eukaryotic cells, and neoplastic or 
transformed mammalian cells. The results from these studies 
indicate that many of the synthetic lytic peptide analogs have 

25 similar or higher levels of lytic activity for many different 
types of cells, compared to the naturally occurring forms. In 
addition, the peptide concentration required to lyse microbial 
pathogens such as protozoans, yeast, and bacteria does not lyse 
normal mammalian cells . However, because previous work 

30 demonstrates that absolute sequence is not important as long as 
positive charge and amphipathy are preserved, the level of 
activity for a given synthetic peptide is difficult to predict. 

The specificity of the lytic action also depends upon the 
concentration of the peptide and the type of membrane with which 

35 it interacts. Jaynes, J.M. et ai., Peptide Research 2: 157 (1989) 
discuss the altered cytoskeletai characteristics of transformed or 
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neoplastic mammalian cells that make them susceptible to lysis by 
the peptides. In these experiments, normal, human non-transformed 
cells remained unaffected at a given peptide concentration while 
transformed cells were lysed; however, when normal cells were 
5 treated with the cytoskeletal inhibitors cytochalasin D or 

colchicine, sensitivity to lysis increased. The experiments show 
that the action of lytic peptides on normal mammalian ceils is 
limited. This resistance to lysis was most probably due to the 
well-developed cytoskeletal network of normal cells. In contrast, 
10 transformed cell lines which have well-known cytoskeletal 

deficiencies were sensitive to lysis. Because of differences in 
cellular sensitivity to lysis, lytic peptide concentration can be 
manipulated to effect lysis of one cell type but not another at 
the same locus. 

15 Synthetic lytic peptide analogs can also act as agents of 

eukaryotic cell proliferation. Peptides that promote lysis of 
transformed cells will, at lower concentrations, promote cell 
proliferation in some cell types. This stimulatory activity is 
thought to depend on the channel-forming capability of -he 

20 peptides, which somehow stimulates nutrient uptake, calcium influx 
or metabolite release, thereby stimulating cell proliferation (see 
Jaynes, J.M., Drug News & Perspectives 3: 69 [1990]; and Reed, 
W.A. et al., Molecular Reproduction and Development 31: 106 
[1992]). Thus, at a given concentration, these peptides stimulate 

25 or create channels that can be beneficial to the normal mammalian 
cell in a benign environment where it is not important to exclude 
toxic compounds. 

The synthetic lytic peptide analogs typically contain as few 
as 12 and as many as 40 amino acid residues. A phenylalanine 

30 residue is often positioned ac the amino terminus of the protein 
to provide an aromatic moiety analogous to the tryptophan residue 
located near the amino terminus of natural cecropins and a UV- 
absorbing moiety with which to monitor the purification of the 
synthetic peptide. The basis for the design of these lytic 

35 peptide analogs is that a peptide of minimal length, having an 
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amphipathic a-helical structural or a fi-pleated sheet moti£, and 
overall positive charge density effects lytic activity. 

Plant disease is one of the leading causes of crop loss in 
the world and is estimated to cause up to one third of total crop 
5 loss worldwide; for example, in the potato losses associated with 
bacterial disease are as high as 25% of worldwide production. 
Additionally, the cultivation of a few species of plants in a 
concentrated area exacerbates the spread of disease. Recent 
advances in genetic engineering have lead to the development of 

10 plants with disease resistant phenotypes based on the expression 
of recombinant DNA molecules. Transgenic tobacco plants were 
engineered with both a wound inducible Pill promoter and a 
constitutive 35S promoter to express two lytic peptides (SHIVA-1 
and SB-37) with bacteriolytic activity. The SHIVA-1 plant 

15 demonstrated enhanced resistance to bacterial wilt caused by 

infection by Pseudcmonas solanacearum (Jaynes, J.M., et al., Plant 
Science 89: 43 (1993); Destef anc-Beltran, L. , et al.. 
Biotechnology in Plant Disease Control, pp. 175-189, Wiley-Liss 
;i993). Thus lytic peptides have valuable uses as anti- 

20 phyt ©pathogenic agents. However, chemical synthesis of these 
lytic peptides is very expensive. Therefore, alternate, more 
economical and efficient methods of synthesis are needed, such as 
in vivo synthesis in host cells using recombinant DMA methods. 

Recombinant DNA molecules are produced by sub-cloning genes 

25 into piasmids using a bacterial host intermediate. In principle 
this technique is straightforward. However, any sequence that 
interferes with bacterial growth through replication or production 
of products toxic to the bacteria, such a lytic peptides, are 
difficult to clone. Often, host bacterial cells containing 

30 mutated forms of the DNA sequences encoding toxic products will be 
selected. These mutations can result in either decreased 
expression or production of an inactive product . Bacteria will 
even insert mutations that prevent expression of a potentially 
toxic product in cloned genes controlled by a eukaryotic promoter 

35 that is not active in prokaryotes. The effect of this selection 
of mutated species leads to an inability to isolate sub-clones 
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containing a non-mutated gene of choice. Thus, some sub-cloned 
genes are unstable in their bacterial hosts, although this 
instability can only be shown empirically. The bacteriolytic 
activity of the lytic peptides is an obstacle to the production of 
5 stable recombinant DNA molecules that express the genes at high 
levels . 

For example, in an attempt to sub-clone into a standard 
plasmid vector a gene coding for frog magainin, a natural lytic 
peptide, bacterial transf ormants contained deletion mutations in 

10 the magainin coding region. Another attempt was made to sub-clone 
a synthetic lytic peptide (SEQ ID NO. 98) into a standard plasmid 
vector (puC19) containing the Cauliflower Mosaic virus 35S 
promoter. The resulting transf ormants were screened by polymerase 
chain reaction (PCR) . However, out of 30 colonies, only 2 sub- 

15 clones gave faint positive signals. These two sub-clones were 
sequenced. The sequence showed that one clone had a point 
mutation that introduced a stop codon 3/4 of the way through the 
lytic peptide, and the other clone had a point mutation that 
changed the start codon from methionine to isoleucine. Both 

20 mutations would prevent the biosynthesis of the protein. Four 
more clones were analyzed, and of these four, one was sub-cloned 
-n the wrong orientation, and three others had mutations 
introduced into the sequence. One of these sub-clones was 
selected for further analysis, but it inhibited the growth of its 

25 E. coli host. Thus, the production of recombinant DNA molecules 
coding for lytic peptides is difficult due to the uncertainty in 
obtaining the correct sub-clone. 

Ubiquitin is a small, highly conserved protein present in all 
eukaryotes. Ubiquitins are encoded by gene families that are 

30 characterized by two types of basic structures. Polyubiquitin 

genes contain several direct repeats of ubiquitin, and ubiquitin- 
ribosomal fusion genes encode a single ubiquitin unit fused to the 
coding region for a small ribosomal associated protein. Both of 
these gene zypes are translated as polyproteins and then are 

35 processed by an endogenous ubiquitin hydrolase present in 

eukaryotes to release multiple ubiquitin proteins or ubiquitin and 
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the ribosomal associated protein. A number of ubiquitin cDNAs or 
genomic clones have been isolated, including plant ubiguitin cDNAs 
and genomic clones from the potato (Garbarino, J. and Belknap, W. , 
Plant Molecular Biology 24: 119 (1994); Garbarino. J. et al., 
5 Plant Molecular Biology 20: 235 (1992)). 

U.S. Patents 5,093,242 and 5,132,213 to Bachmair et ai. teach 
the use of a ubiquitin cloning vector as a method of producing 
specified protein amino- termini . A recombinant DNA molecule was 
constructed with a protein coding gene fused at its amino terminus 

10 to a ubiquitin coding gene. Due to translation as a polypeptide 
and cleavage by hydrolases, a protein with any amino acid at the 
amino terminus can be generated. The amino terminus can be used 
to control the metabolic stability of the protein. However, the 
metabolic stability of the protein is dependent on the resulting 

15 amino acid at the amino -terminus, not the generation of a 
translation polypeptide. 

The forgoing facts suggest that although lytic peptides as a 
class may include species that are efficacious in destroying 
bacteria, neoplastic cells, fungi, virus-infected cells, and 

20 protozoa, this lytic characteristic also decreases the stability 
of sub-cloned lytic peptides in host cells . This decreased 
stability hinders efforts to develop a more economical and 
efficient means of synthesizing lytic peptides. 

It would therefore be a significant advance in the art, and 

25 is correspondingly an object of the present invention to develop a 
method of sub-cloning nucleotide sequences coding for lytic 
peptides into expression vectors, providing gene constructs with 
enhanced stability and gene expression and reduced toxicity. 

30 SUMMARY OP THE INVENTION 

The present invention relates generally to ubiquitin-lytic 
peptide fusion nucleic acid expression vectors comprising a 
promoter and ubiquitin polypeptide coding sequence ligated to a 
35 lytic peptide, ubiquitin-lytic peptide fusion protein products. 
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and methods of making and using the same, as hereinafter more 
fully described. 

It is another object of the invention to provide ubiquitin-* 
lytic peptide fusion expression vectors and protein products 
5 derived therefrom. 

It i3 another object of the invention to provide ubiquitin- 
lytic peptide fusion expression vectors that are expressed in 
plants having utility for promoting wound healing and combatting 
bacterial infections in plants. 
10 It is a further object of this invention to provide 

ubiquitin-lytic peptide fusion polypeptides having utility for 
combatting protozoal infections, neoplasias, fungal infections, 
viral infections, and bacterial infections in mammals and plants. 
It is yet another object of this invention to develop a 
15 method of sub-cloning polypeptide sequences in ubiquitin-fusion 
expression vectors with enhanced stability and gene expression. 

It is yet another object of this invention to provide 
expression vectors containing constitutive and wound inducible 
ubiquitin promoters that are expressed in eukaryotic cells. 
20 It is yet another object of this invention to provide 

expression vectors with prokaryotic promoters that express 
ubiquitin-lytic peptide fusion genes in prokaryotic hosts, the 
products of which can be cleaved in vitro by ubiquitin hydrolases . 
These and other objects and advantages will be more fully 
25 apparent from the ensuing disclosure and claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a map of a recombinant nucleic acid expression 
30 veccor pUCUbi3-LP98 containing a 520 bp ubiquitin-ribosomal fusion 
gene promoter region linked to a 228 bp coding region for a 
ubiquitin polypeptide with a six bp BamHI site at the 3' end (SBQ 
ID NO. 93) that is fused at its 2' end to a gene coding for a 
lytic peptide (D5D* , SEQ ID NO. ?8) . The Ubi3 ubiquitin-lytic 
35 peptide nucleotide sequence corresponds to SEQ ID NO. 92. A 
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nopaline synthase polyadenylation signal is located at the 3' end 
of the lytic peptide gene. 

Figure 2 is a map of a recombinant nucleic acid expression 
vector pUCUbi7-LP98 containing a 1220 bp polyubiquitin promoter 
region and 568 bp intron linked to a 228 bp coding region for a 
ubiguitin polypeptide with a six bp BamHI site at the 3' end (SEQ 
ID NO. 96) that is fused at its 3' end to a gene coding- for a 
lytic peptide {D5D* , SEQ ID NO. 98). The Ubi7 ubiquitin-lytic 
peptide nucleotide sequence corresponds to SEQ ID NO. 95. A 
nopaline synthase polyadenylation signal is located at the 3 ' end 
of the lytic peptide gene. 

PmiLKD DESCRIPTION OF THE INVENTION AND 
PREFERRED EMBODIMENTS THEREOF 



The disclosures of prior co-pending U.S. Patent Application 
No. 08/039,620 filed June 4, 1993 in the names of Jesse M. Jaynes 
and Gordon R. Julian. U.S. Patent Application No. 08/148.889 filed 
November 8, 1993 in the name of Gordon R. Julian, U.S. Patent 
Application No. 08/148,491 filed November 8, 1993 in the name of 
Gordon R. Julian. U.S. Patent Application No. 08/225,476 filed 
April 8, 1994 in the names of Jesse M. Jaynes and Gordon R. 
Julian, and U.S. Patent Application No. 08/231,730 filed April 20, 
1994 in the names of Jesse M. Jaynes and Gordon R. Julian, are all 
hereby incorporated herein by reference in their entirety. 

The term "amphipathic" as used herein refers to the 
distribution of hydrophobic and hydrophilic amino acid residues 
along opposing faces of an ct-helix structure or other secondary 
conformation, which results in one face of the a-helix structure 
being predominantly hydrophobic and the other face being 
predominantly hydrophilic. The degree of amphipathy of a peptide 
can be assessed by plotting the sequential amino acid residues on 
an Edmunson helical wheel (see also Kamtekar, S. et al . , Science 

262: 1680 11993) . 

The terms "peptide" and 'polypeptide" as used herein refer to 
a molecule composed of a chain of amino acid residues and is 

8 
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intended co be construed as inclusive of polypeptides and peptides 
per se having molecular weights of up to 10,000 daltons, as well 
as proteins having molecular weights of greater that about 10,000 
daltons, wherein the molecular weights are number average 
5 molecular weights. The term is also intended to be construed as 
inclusive of functional equivalents thereof when used in reference 
to a specific peptide coding sequence in the specification and 
claims herein. Functional equivalents of peptides and 
polypeptides include but are not limited to deletions, additions, 

10 and substitutions of amino acids in the polypeptide- or peptide 
chain that do not adversely affect the overall function of the 
resulting peptide or polypeptide. 

The term "plasmid* as used herein refers to a DNA molecule 
that is capable of autonomous replication within a host cell, 

15 either extrachromosomally or as part of the host cell 

chromosome ( s ) . The starting plasmids herein are commercially 
available, are publicly available on an unrestricted basis, or can 
be constructed from such available plasmids as disclosed herein 
and/or in accordance with published procedures. In certain 

20 instances, as will be apparent to the ordinarily skilled artisan, 
other plasmids known in the art may be used interchangeable with 
plasmids described herein. 

The term "ligation* as used herein refers to the process of 
forming phosphodiester bonds between two double-stranded DNA 

25 fragments. Unless otherwise specified, ligation is accomplished 
using standard procedures known to one skilled in the art. 

The term 'polymerase chain reaction," or "PCR* as used herein 
refers to a method for amplification of a desired nucleotide 
sequence in vitro, as described in U.S. Patent No. 4,683,195, 

30 herein incorporated by reference in its entirety. 

The term "nucleic acid" as used herein refers to 
deoxyribonucleic acid molecules (DNA) composed of a chain of 
deoxyribonucleotides and ribonucleic acid molecules (RNA! composed 
of a chain of ribonucleotides . The term "nucleic acid" as used 

35 herein is to be construed as including functional equivalents 

thereof when used in reference to a specific nucleotide sequence 

9 
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in the specification and claims herein. Functional equivalents of 
nucleic acid molecules include synonymous coding sequences with 
one or more codon substitutions and deletions or additions that 'do 
not effect the overall function of the resulting nucleic acid 
5 molecule. The degeneracy of the genetic code is well known to the 
art; therefore, synonymous coding sequences with one or more codon 
substitutions can be readily determined by one of ordinary skill 
in the art. Synonymous nucleotide coding sequences vary from the 
exemplified coding sequences but encode proteins of the same amino 

10 acid sequences as those specifically provided herein or proteins 
with similar function and are therefore also regarded as 
functional equivalents thereof. 

The term "promoter" as used herein refers to an untranslated 
(i.e. one that does not result in a peptide or protein product) 

15 sequence upstream of the polypeptide coding region of a nucleotide 
sequence that controls transcription of a gene. Promoters 
typically fall into two classes, constitutive and inducible. 
Inducible promoters initiate high levels of transcription of the 
r.ucieic acid under their control in response to external stimuli. 

20 Constitutive promoters maintain a relatively constant level of 

transcription in a given cell. Suitable promoters for use in the 
present may include both prokaryotic and eukaryotic promoters, 
with all ubiquitin promoters being preferred, solanaceous plant 
ubiquitin promoters being highly preferred, and potato ubiquitin 

25 promoters being most preferred. Additional control sequences such 
as ribosamal binding sites and enhancers may be included as 
control sequences when necessary. 

The term "polyadenylation site* as used herein refers to a 
control sequence located on the 2 ' end of a gene construct that 

30 provides a signal for cleavage and polyadenylation of the 

transcription unit expressed from the promoter. These control 
sequences are known to one skilled in the art 

The term "expression" as used herein refers to transcription 
and ;or translation of a nucleic acid sequence coding for a 

35 protein or peptide. 
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In one embodiment, the present invention is directed to an 
isolated nucleotide sequence comprising a gene coding for a 
ubiquitin polypeptide and functional equivalents thereof, linked 
to a ubiquitin promoter and functional equivalents thereof. 
5 Suitable ubiquitin promoters for use in the present invention 
include, but are not limited to, ubiquitin promoters from 
solanaceous plants. Preferably, the ubiquitin promoter' is a 
potato plant ubiquitin promoter and most preferably it is the 
potato Ubi3 or Ubi7 promoter. In embodiments wherein the isolated 

10 nucleotide sequence codes for the potato Ubi3 promoter linked to a 
gene coding for a ubiquitin polypeptide it has a nucleotide 
sequence according to SEQ ID NO. 93 . The Ubi3 promoter alone also 
has utility as constitutive promoter in eukaryotes. 

In embodiments wherein the isolated nucleotide sequence codes 

IS for the potato Ubi7 promoter linked to a gene coding for a 

ubiquitin polypeptide it has a nucleotide sequence according to 
SEQ ID NO. 96. The Ubi7 nucleotide sequence according to SEQ ID 
NO. 96 includes an intron that is part of the ubiquitin 
transcription unit. The intron is not required for gene 

20 expression from the Ubi7 promoter, thus the Ubi7 promoter region 
without the intron can be considered as a specific functional 
equivalent of the Ubi7 promoter. The UbiT promoter alone, with 
or without the intron, has utility as a wound inducible promoter 
in eukaryotes. 

25 Preferably, the nucleotide sequence comprising the isolated 

ubiquitin promoter and gene coding for a ubiquitin polypeptide 
further comprises a gene coding for a lytic peptide ligated to the 
3' end of the gene coding for a ubiquitin polypeptide. Suitable 
genes coding for a lytic peptide have a nucleotide sequence coding 

20 for any one of the amino acid sequences according to SEQ ID NO. 1- 
91 and 97-98. 

In one preferred embodiment, the present invention is 
directed to an isolated nucleotide sequence comprising a gene 
coding for a lytic peptide ligated to the 2 ■ end of the gene 
35 coding for a ubiquitin polypeptide linked to the Ubi3 ubiquitin 
promoter having a nucleotide sequence according zo SEQ ID NO. 92. 
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In an alternative of this embodiment, the present invention is 
directed to an isolated nucleotide sequence comprising a gene 
coding for a lytic peptide ligated to the 3 ' end of the gene 
coding for a ubiquitin polypeptide linked to a Ubi7 ubiquitin 
5 promoter having a nucleotide sequence according to SEQ ID NO. 95. 

In another embodiment, the present invention is directed to a 
recombinant nucleic acid expression vector. The vector is 
characterized in that it comprises a nucleotide sequence wherein a 
gene coding for a ubiquitin polypeptide is linked to a ubiquitin 

10 promoter. Preferably, the present invention is directed to a 

recombinant nucleic acid expression vector characterized in that 
it further comprises a nucleotide sequence wherein a gene coding 
for a lytic peptide is ligated to the 3 1 end of the gene coding 
for a ubiquitin polypeptide linked to a ubiquitin promoter. 

15 Suitable vectors for use in this invention include any eukaryotic 
or prokaryotic expression vectors known in the art. Preferable 
vectors for use in this invention are pUC19 and pCGN1547 . 

In another embodiment, the present invention is directed to a 
host cell that is trar.sf ormed by a recombinant DNA expression 

20 vector comprising a gene coding for a ubiquitin polypeptide linked 
to a ubiquitin promoter. Suitable host cells for transformation in 
the present invention include ail known bacterial host ceils, with 
all strains of Escherichia coli and Acre-bacterium tumefaciens 
being preferred. Preferably, the present invention is directed 

25 to a host cell the recombinant DNA expression vector further 

comprises a gene coding for a lytic peptide ligated to the 3 ■ end 
of the gene coding for a ubiquitin polypeptide linked to a 
ubiquitin promoter. Suitable genes coding for a lytic peptide 
have a nucleotide sequence coding for any one of the amino acid 

30 sequences according to SSQ ID NO. 1-91 and 97-98. 

Preferably, the present invention is directed to a 
solanaceous plant host cell that is transformed by a recombinant 
DNA expression vector. Most preferably the solanaceous plant cell 
is a potato plant host ceil. 

35 in another embodiment, the present invention is directed to 

an isolated nucleotide sequence and functional equivalents thereof 
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coding for a lytic peptide, where the nucleotide sequence has a 
sequence coding for any one of the amino acid sequences according 
to SEQ ID NO. 1-91 and 97-98. 

In yet another embodiment, the present invention is directed 
5 to a purified ubiquitin polypeptide and functional equivalents 
thereof having an amino acid sequence according to SEQ ID NO. 94. 
This embodiment can further comprise a lytic peptide 
translationally fused to the carboxy terminus of a ubiquitin 
polypeptide . 

10 In another embodiment, the present invention is directed to a 

method of sub-cloning nucleotide sequences coding for lytic 
peptides and expressing such sequences in cells. The method 
comprises a first step wherein a recombinant nucleic acid 
containing a gene coding for a lytic peptide ligated to a gene 

15 coding for a ubiquitin polypeptide linked to a ubiquitin promoter 
is produced in a first host cell. Suitable first host cells 
include any known bacterial host cells. Preferably, the first 
host cell is either an Escherichia coli cell or an Agrobacterium 
zumefaciens cell . 

20 If the peptides are sub-cloned using such a ubiquitin- fusion 

expression vector, the following advantage results: the lytic 
peptide gene constructs have increased stability in the bacterial 
host. While not wishing to be bound by any one theory, the 
present inventors believe that the stability is due to the 

25 ubiquitin protein coding nucleic acid region fused to the 5 ' end 
of the lytic peptide nucleic acid sequence. Bacteria do not 
contain the endogenous hydrolase necessary for cleavage of the 
ubiquitin fusion protein, so the gene constructs are not toxic to 
bacteria, since active lytic peptide cannot released. Thus 

20 functional equivalents of the ubiquitin fusion polypeptide include 
any ubiquitin molecule that is capable of deceiving the host cell 
into viewing the gene construct and its products as non-toxic. 

In a variation of this embodiment, the recombinant nucleic 
acid vector is isolated from the first host cell and expressed in 

35 a second host cell. Suitable second host cells are plant and 
animal cells, preferably a solanaceous plant cell, and most 
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preferably a potato plant cell. In the second host cell the 
fusion gene is expressed at high levels and the polyprotein is 
cleaved by endogenous ubiquitin hydrolases to produce active lytic 
peptide. These transgenic hosts provide from the expression 
vector lytic peptides in vivo to combat bacterial infections, 
fungal infections, protozoal infections, virus infections, and 
neoplasias . In addition, expression vectors containing- ubiquitin 
promoters that are either constitutive or wound inducible are used 
to express peptides in eukaryotes. 

The present invention is also directed to a method of sub- 
cloning nucleotide sequences coding for lytic peptides and 
expressing such sequences in cells . The method comprises producing 
in a host cell a recombinant nucleic acid expression vector 
comprising a gene coding for a lytic peptide ligated to the 3 • end 
of a gene coding for a ubiquitin promoter linked to a prokaryotic 
promoter sequence. Suitable prokaryotic promoters include those 
known to one skilled in the art to be active in prokaryotes and 
used in plasmid vectors for bacterial gene expression. 

The recombinant nucleic acid expression vector is expressed 
in the host cell and ubiquitin-lytic peptide fusion polypeptides 
are isolated from the host. Preferably, the host cell is either 
an Escherichia coli cell or an Agrobacterium timefaciens cell. 
The isolated ubiquitin-lytic peptide fusion polypeptides are then 
cleaved in vitro by ubiquitin hydrolases to release the lytic 
peptides from the ubiquitin polypeptide (see U.S. Patent No. 
5,196,321 to Bachmair et aJ.). The active lytic peptides are then 
used to treat bacterial infections, fungal infections, protozoal 
infections, virus infections, and neoplasias. These isolated 
lytic peptides are in some instances glyoxylated or methylated in 
vitro to stabilize against proteolytic digestion in vivo. 

Ubiquitin fusion expression vectors thus have broad utility 
as cloning and expression vectors to stabilize and sub-clone lytic 
peptides nucleotide sequences, as well as a wide variety of 
protein coding nucleic acid sequences thac are otherwise toxic to 
their hosts. The ubiquitin-lytic peptide expression vectors also 
have broad utility as an economical and efficient means to 
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synthesize lytic peptides in host cells. These lytic peptides 
have utility Cor combatting protozoal infections, neoplasias, 
fungal infections, viral infections, and bacterial infections in 
mammals and plants. 



5 



The features and advantages of the invention are more fully 
shown by the following illustrative examples and embodiments, 
which are not to be limitingly construed as regard the broad 
scope, utility, and applicability of the invention. 



10 



Example 1 



Ranrpssnrativs Lytic PeprirtP* and TThicniitin nolvnenr.ide 

Set out in Table 1 below as illustrative examples of lytic 
peptides are the amino acid sequences of families of related lytic 

15 peptides. These lytic peptides are designated for ease of 

reference as SEQ ID NO. 1-91 and 97-98. Nucleic acid sequences 
coding for these lytic peptides and functional equivalents thereof 
represent examples of lytic peptide nucleic acid sequences that 
are sub-cloned to make ubiqui tin- lytic peptide fusion gene 

20 constructs and polypeptides. The ubiquitin polypeptide, 

designated for ease of reference as SEQ ID NO. 94, and functional 
equivalents thereof , represents an example of the 5 ' fusion 
ubiquitin polypeptide. 

25 TABU 1: LYTIC PEPTIDE SEQUENCES 



SEP TP WO. 1 

Phe Ala Val Ala 



Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 
5 10 15 

Lys Ala Val Lys Lys Lys Lys 



1 



30 Lys Ala Val Lys 
20 



25 



fiEfl TP NO. 2 



35 



Phe Ala Val Ala 



1 



vai Lys Ala Val Ala val Lys Ala Val Lys Lys Ala 
5 10 15 
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Val Lys Lys Val Lys Lys Ala Val 
20 

gpfi m wry 3 

Phe Ala Val Ala Val Lys Ala Val 

1 5 
Ala Val Lys Lys Ala Val Lys Lys 
20 

Val Lys Lys Lys Lys 
35 



Lys Lys Ala Val Lys Lys Lys Lys 
25 30 



Ala Val Lys Ala Val Ala Val Lys 

10 15 
Val Lys Lys Ala Val Lys- Lys Ala 
25 30 



qgn TP NO. 4 

Bhe Ala Val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 
"l 5 10 15 

Lys Ala Val Lys Lys Ala Val 
20 



grn TP no. 5 

Phe Ala Val Ala Val Lys Ala Val 

1 5 
Val Lys Lys Val Lys Lys Ala Val 

20 



Ala Val Lys Ala Val Lys Lys Ala 

10 15 
Lys Lys Ala Val 
25 



'm TT7 NO. 6 

Phe Ala Val Ala Val Lys Ala Val 

1 5 
Ala Val Lys Lys Ala Val Lys Lys 
20 

Val 



Ala Val Lys Ala Val Ala Val Lys 

10 15 
Val Lys Lys Ala Val Lys Lys Ala 
25 30 



swn td no. 7 

Phe Ala Val Gly Leu Arg Ala lie 

1 5 
Arg Gly Val Arg Lys Val Ala Lys 
20 



Lys Arg Ala Leu Lys Lys Leu Arg 

10 15 
Arg Lys Arg 
25 
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SEP. rn no. 8 

Phe Ala Val Gly Leu Arg Ala lie Lys Arg Ala Leu Lys Lys Leu Arg 

1 ' 5 10 15 

Arg Gly Val Arg Lys Val Ala 
5 20 

finn -m wo. 9 

Lys Arg Lys Arg Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 
15 10 15 

10 Ala Arg Lys He Ala Arg Leu Gly Val Ala ?he 
20 25 

Sfn m NO. 10 

Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu Ala Arg Lys He 
15 1 

Ala Arg Leu Gly Val Ala Phe 
20 

20 Phe Ala Val Gly Leu Arg Ala lie Lys Arg Ala Leu Lys Lys Leu Arg 

5 10 15 

Arg Gly Val Arg Lys Val Ala Lys Arg Lys Arg Lys Lys Asp Leu 
20 25 30 



25 SEP Tr> wn - 12 

Phe Ala val Gly Leu Arg Ala lie Lys Arg Ala Leu Lys Lys Leu Arg 

5 10 15 

Arg Gly Val Arg Lys Val Ala Lys Asp Leu 
20 25 

30 

SEP. ™ U 

Lys Arg Lys Arg Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 

15 10 15 

Ala Arg Lys He Ala Arg Leu Gly val Ala ?he Lys Asp Leu 
35 20 25 30 
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SEP TP NT). Id 

Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu Ala Arg Lys lie 

1 5 10 15 

Ala Arg Leu Gly Val Ala Phe Lys Asp Leu 
5 20 25 

SEQ ID NQ. 15 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
1 5 10 15 

10 Ala Lys Lys Val Ala Lys Val Ala Val Ala Val 
20 25 

sep zn NP. 16 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Lys val Lys Lys Val 
15 1 5 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Lys Val Ala Val Ala Val 
20 25 30 



SEQ IV HP. 17 

20 Lys Lys Lys Lys Fr.s Yai Lys Lys Val Ala Lys Lys Val Lys Lys Val 
1 : 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Lys Val Ala Val Ala Lys 

20 25 30 

Val Ala Val Ala Val 
25 35 



SEQ ID HO. 18 

Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val Ala Lys Lys Val 
15 10 15 

30 Ala Lys Val Ala Val Ala Val 
20 



sen td No, ig 

Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val Ala Lys Lys Val 
35 1 z 10 15 

Ala Lys Val Ala V=i Ala Lys Val Ala Val Ala Val 
20 25 
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SEP ID HQ. 20 

Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val Ala Lys Lys Val 
15 10 15 

5 Ala Lys Val Ala Val Ala Lys Val Ala Val Ala Lys Val Ala Val Ala 
20 25 30 

Val 



SEQ ID NO. 21 

10 Lys Lys Lys Lys Phe Val Lys Lys 
1 5 
Ala Lys Val Ala Lys Lys Val Ala 
20 



Val Ala Lys Val Ala Lys Lys Val 

10 15 
Lys Lys Val 
25 



15 SEP TP NO. 22 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val 

15 10 15 

Ala Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala 
20 25 30 



20 



25 



SEQ ID NP. 23 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val 

15 10 15 

Ala Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala 

20 25 30 

Lys Val Ala Lys Lys 
35 



SEQ TP NO. 24 

2Z Phe Val Lys Lys Val Ala Lys Vai .-J.a Lys Lys Val Ala Lys Val Ala 
15 10 15 

Lys Lys Val Ala Lys Lys Val 
20 
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sfo m NO . 25 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 

1 5 10 15 

Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala 
5 20 25 

SEP TP NO. 26 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
15 10 15 

10 Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Val Ala Lys 
20 25 30 

Lys 

sun mm. 27 

15 Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
1 5 10 15 

Lys Lys Val Ala Lys Lys Val Lys Lys Lys Lys 
20 25 

20 ggn tti no . 2H 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 

15 10 15 

Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Lys Lys Lys 
20 25 30 

25 

fiW TT7 NO. 2? 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 

15 10 15 

Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Val Ala Lys 
30 20 25 30 

Lys Lys Lys Lys Lys 
35 

gpn m NO. 30 

35 Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val lys Lys Lys Lys Lys 
15 10 15 



20 



WO 96/03522 



PCT/US95/09338 



SEQ Tn wr> ^1 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
15 10 15 

5 Ala Lys Lys Lys Lys 
20 

SEQ IB-MS- v 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
10 1 5 10 15 

Ala Lys Val Lys Ala Lys Val Lys Lys Lys Lys 
20 25 

sfd to wr> a 

15 Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
15 10 

sra mm -u 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
20 1 5 10 15 

Ala 

SEP TP NO. 25 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
25 1 5 10 15 

Ala Lys Val Lys Ala Lys Val 
20 

SEP rr> no. is 

30 Lys Lys Lys Lys Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
15 10 15 

SEP TP NO. -37 

Lys Lys Lys Lys Plie Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
35 1 5 10 15 
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Ala Lys Val Lys Ala 
20 

SEP ID MO. 38 

5 Lys Lys Lys Lys Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
15 10 15 

Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val 
20 25 



15 



20 



SEP TP NT) . "39 

Phe Lys Lys Val Lys Lys Val Ala Lys Lys Val Cys Lys Cys Val Lys 

15 10 15 

Lys Ala Val Lys Lys Val Lys Lys Phe 
20 25 

SEP TP NP. 40 

Phe Ala Val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 

15 10 15 

Lys Ala Val Lys Lys Ala Val Cys Cys Cys Cys 
20 25 



SEP TP NP. 41 

Cys Cys Cys Cys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
15 10 15 

25 Ala Lys Lys Val Ala Lys Val Ala Val Ala Val 
20 25 

SEP in NP. 42 

Phe Ala Val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 
20 1 5 10 15 

Lys Ala Val Lys Lys Ala Val Ser Ser Ser Ser 
20 25 



SEQ IP W. 43 

35 Ser Ser Ser Ser Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
15 10 15 
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Ala Lys Lys Val Ala Lys Val Ala 
20 

SEQ Tn tot. Ad 
5 Phe Ala Leu Ala Leu Lys Ala Leu 
1 5 
Lys Ala Leu Lys Lys Ala Leu 
20 



val Ala Val 
25 



Lys Lys Ala Leu Lys Lys Leu Lys 
10 15 



10 sfo td mo. as 

Leu Ala Lys Lys Leu Ala Lys Lys Leu Lys Lys Leu Ala Lys Lys Leu 

15 10 15 

Ala Lys Leu Ala Leu Ala Phe 
20 



15 



20 



Phe Ala Phe Ala Phe Lys Ala Phe Lys Lys Ala Phe Lys Lys Phe Lys 

15 10 15 

Lys Ala Phe Lys Lys Ala Phe 
20 



qgn in mo. 47 

Phe Ala He Ala He Lys Ala He Lys Lys Ala He Lys Lys He Lys 
15 10 15 

25 Lys Ala He Lys Lys Ala He 
20 



fifiQ TD MO- 48 

Phe Ala Lys Lys Phe Ala Lys Lys Phe Lys Lys Phe Ala Lys Lys Phe 
30 1 5 10 15 

Ala Lys Phe Ala Phe Ala Phe 
20 
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SBQ IP HQ. 4.9 

Phe Lys Arg Leu Ala Lys lie Lys Val Leu Arg Leu Ala Lys lie Lya 

15 10 15 

Arg 

5 

SBQ IP NO. 50 

Lys Leu Lys Leu Ala Val Lys Leu Val Gly Leu Leu Arg Lys- Lys Arg 

15 10 15 

Ala Leu Lys lie Ala Leu Arg Gly Val Ala Lys Arg Ala Gly Arg Leu 
10 20 25 30 

Ala Val Arg Lys Phe 
35 



SEP TP MO. SI 

15 Phe Ala Arg Ala Arg Lys Ala Arg Lys Lys Ala Arg Lys Lys Arg Lys 
15 10 15 

Lys Ala Arg Lys Lys Ala Arg Lys Asp Arg 
20 25 



20 SEP TP NO. 52 

Phe Ala Val Ala Val Cys Ala Val Cys Cys Ala Val Cys Cys Val Cys 

1 5 10 15 

Cys Ala Val Cys Cys Ala Val 
20 

25 

SEP IP NO. 53 

Phe Ala Val Ala Val Ser Ala Val Ser Ser Ala Val Ser Ser Val Ser 

15 10 15 

Ser Ala Val Ser Ser Ala Val 
30 20 



SEP TP NO. 54 

Phe Ala Val Ala Val Ser Ala Val 
1 5 
35 Ser Ala Val Ser Ser Ala Val Ser 
20 



Ser Ser Ala Val Ser Ser Val Ser 

10 15 
Ser Ser Ser 
25 
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SEP ID NO ■ 55 

Phe Ala Lys Lys Phe Ala Lys Lys 

1 5 
Ala Lys Phe Ala Phe Ala Phe Lys 
20 



Phe Lys Lys Phe Ala Lys Lys Phe 

10 15 
Lys Lys Lys 
25 



fffiO TP NO. 56 

Lys Lys Lys Lys Phe Ala Lys Lys 

1 5 
Ala Lys Lys Phe Ala Lys Phe Ala 
20 



Phe Ala Lys Lys Phe Lys Lys Phe 

10 15 
Phe Ala Phe 
25 



SEfl m wo. 57 

Phe Ala Arg Lys Phe Leu Lys Arg 

1 5 
lie Arg Phe Ala Phe Leu Phe 
20 



Phe Lys Lys Phe Val Arg Lys Phe 
10 15 



RBO TP NO . 59 

Phe Ala Arg Lys Phe Leu Lys Arg 

1 5 
lie Arg Phe Ala Phe Leu Phe Lys 
20 



Phe Lys Lys Phe Val Arg Lys Phe 

10 15 
Arg Lys Arg 
25 



SRO TT) MO. 59 

Lys Arg Lys Arg Phe Ala Arg Lys 

1 5 
Val Arg Lys Phe He Arg Phe Ala 

20 

SEQ Tn wo - 60 

lie Ala Lys Lys lie Ala Lys Lys 

1 5 
Ala Lys He Ala He Ala lie 



Phe Leu Lys Arg Phe Lys Lys Phe 

10 15 
Phe Leu Phe 
25 

He Lys Lys He Ala Lys Lys He 
10 15 
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SEP. Tn wr>. 61 

He Ala Lys Lys He Ala Lys Lys 
1 5 
5 Ala Lys He Ala He Ala He Lys 
20 

Lys Lys Lys Lys He Ala Lys Lys 
10 1 5 

Ala Lys Lys He Ala Lys He Ala 
20 



He Lys Lys He Ala Lys Lys He 

10 15 
Lys Lys Lys 
25 

He Ala Lys Lys He Lys Lys He 

10 15 
He Ala He 
25 



qRO td NO. 63 

15 lie Ala Arg Lys He Leu Lys Arg He Lys Lys He Val Arg Lys Phe 
15 10 W 

He Arg He Ala He Leu He 
20 



20 fifri m NO. 64 

He Ala Arg Lys He Leu Lys Arg He Lys Lys He Val Arg Lys Phe 

15 io 15 

He Arg He Ala He Leu He Lys Arg Lys Arg 
20 25 



25 



30 



SEP. td NO. 65 

Lys Arg Lys Arg He Ala Arg Lys He Leu Lys Arg He Lys Lys He 

1 5 .10 15 

val Arg Lys Phe He Arg He Ala He Leu He 

20 25 



fW) T HQ- 66 

Phe Lys Leu Arg Ala Lys He Lys 
1 5 
35 Leu 



Val Arg Leu Arg Ala Lys He Lys 

10 15 



26 
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SRO TP NO. 67 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lya lie Lys Val Arg Leu Arg 

1 ' 5 10 15 

Ala Lys lie Lys Leu 
5 20 



SEP. rn wn. fift 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 
15 10 15 

10 Leu Lys Arg Lys Arg 
20 

SRO TP WO. fi9 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 
15 1 5 10 15 

Leu Arg Val Lys Leu Lys He 
20 



SKO TP NO . 70 
20 Phe Lys Leu Arg Ala Lys He Lys 
1 5 
Leu Arg Val Lys Leu Lys He Lys 
20 



Val Arg Leu Arg Ala Lys He Lys 

10 15 
Arg Lys Arg 
25 



25 SEP ID NO. 71 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg 

15 10 15 

Ala Lys He Lys Leu Arg Val Lys Leu Lys He 
20 25 

30 

SEQ ID MDi 72 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 

15 10 15 

Leu Arg Val Lys Leu Lys He Arg Ala Arg He Lys Leu 
35 20 25 
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SB) TP MP. 73 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 

1 ' 5 - 10 « 

Leu Arg Val Lys Leu Lys He Arg Ala Arg He Lys Leu Lys Arg Lys 
20 25 30 

Arg 

SEP TT> NO, 74 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg 

15 10 15 

Ala Lys He Lys Leu Arg Val Lys Leu Lys He Arg Ala Arg He Lys 
20 25 30 

Leu 

m wo. 75 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 

15 10 15 

Leu Val Phe Ala He Leu Leu 

20 



Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 

15 10 15 

Leu Val Phe Ala He Leu Leu Lys Arg Lys Arg 
20 25 



QPO TP NO. 77 

Lys Arg Lys Arg Phe Lys Leu Arg 

1 5 
Ala Lys He Lys Leu Val Phe Ala 
20 

QFH TP NO- 78 

Val Phe Ala He Leu Leu Phe Lys 
1 5 



Ala Lys He Lys Val Arg Leu Arg 

10 15 
He Leu Leu 
25 

Leu Arg Ala Lys He Lys Val Arg 
10 15 
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Leu Arg Ala Lys He Lys Leu 
20 

SBO TH NT>. 79 

5 Val Phe Ala He Leu Leu Phe Lys Leu Arg Ala Lys He Lys Val Arg 
15 10 15 

Leu Arg Ala Lys He Lys Leu Lys Arg Lys Arg 
20 25 

10 sm -n wo. 80 

Lys Arg Lys Arg Val Phe Ala He Leu Leu Phe Lys Leu Arg Ala Lys 

15 10 15 

He Lys Val Arg Leu Arg Ala Lys He Lys Leu 
20 25 

15 

skd -rr, wr> , 81 

Val Gly Glu Cys Val Arg Gly Arg Cys Pro Ser Gly Met Cys Cys Ser 

15 10 15 

Gin Phe Gly Tyr Cys Gly Lys Gly Pro Lys Tyr Cys Gly 
20 20 25 

SBO TP NO. 82 

Val Gly Glu Cys Val Arg Gly Arg Cys Pro Ser Gly Met Cys Cys Ser 
15 10 15 

25 Gin Phe Gly Tyr Cys Gly Lys Gly Pro Lys Tyr Cys Gly Arg 
20 25 30 

fffiO ID NO. 83 

Leu Gly Asp Cys Leu Lys Gly Lys Cys Pro Ser Gly Met Cys Cys Ser 
30 1 5 10 15 

Asn Tyr Gly Phe Cys Gly Arg Gly Pro Arg Phe Cys Gly Lys 
20 25 30 



STO -2 MO. 84 

35 Gin Cys He Gly Asn Gly Gly Arg Cys Asn Glu Asn Val Gly Pro Pro 
15 10 15 
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Tyr Cys Cys Ser Gly Phe Cys Leu Arg Gin Pro Gly Gin Gly Tyr Gly 

20 25 30 

Tyr Cys Lys Asn Arg 
35 

5 

ran Tn *rv as 

Cys lie Gly Asn Gly Gly Arg Cys Asn Glu Asn Val Gly Pro- Pro Tyr 

15 10 15 

Cys Cys Ser Gly Phe Cys Leu Arg Gin Pro Asn Gin Gly Tyr Gly Val 
10 20 25 30 

Cys Arg Asn Arg 
35 



fim TP NP. 86 

15 Cys He Gly Gin Gly Gly Lys Cys Gin Asp Gin Leu Gly Pro Pro Phe 
1 5 10 15 

Cys Cys Ser Gly Tyr Cys Val Lys Asn Pro Gin Asn Gly Phe Gly Leu 

20 25 30 

Cys Lys Gin Lys 
20 35 



SKT> TD NO. 87 

Gin Lys Leu Cys Glu Arg Pro Ser Gly Thr Trp Ser Gly Val Cys Gly 
1 5 10 15 

25 Asn Asn Asn Ala Cys Lys Asn Gin Cys He Asn Leu Glu Lys Ala Arg 
20 25 30 

His Gly Ser Cys Asn Tyr Val Phe Pro Ala His Lys 
35 40 



30 SEP TP NO. 88 

Gin Arg Val Cys Asp Lys Pro Ser Gly Thr Trp Ser Gly Leu Cys Gly 

15 10 15 

Asn Asn Asn Ala Cys Arg Gin Asn Cys He Gin Val Asp Arg Ala Lys 
20 25 30 

35 Lys Gly Ser Cys Gin Phe Leu Tyr Pro Ala Lys Lys 
35 40 
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gRO mun. ao 

Gin Lys Leu Cys Gin Arg Pro Ser Gly Thr Trp Ser Gly Val Cys Gly 

! 5 10 15 

Asn Asn Asn Ala Cys Lys Asn Gin Cys lie Arg Leu Glu Lys Ala Arg 

20 25 30 

His Gly Ser Cys 
35 



SEQ TP NO. 90 

Gin Arg Val Cys Asn Lys Pro Ser Gly Thr Trp Ser Gly Leu Cys Gly 

15 10 15 

Asn Asn Asn Ala Cys Arg Gin Asn Cys He Lys Val Asp Arg Ala Lys 

20 25 30 

Lys Gly Ser Cys 
35 



3RO TP NO- 91 

Met Leu Glu Glu Leu Phe Glu Glu Met Thr Glu Phe He Glu Glu Val 

15 10 15 

He Glu Thr Met 
20 



swrt m wn. 94 

Met Gin He Phe Val Lys Thr 

1 5 
Thr Gly Lys Thr He Thr Leu 

10 15 
He Asp Asn Val Lys Ala Lys 
25 

Pro Pro Asp Gin Gin Arg Leu 
40 

Glu Asp Gly Arg Thr Leu Ala 
55 

Ser Thr Leu His Leu Val Leu 

65 70 



Leu 

Glu Val Glu Ser Ser Asp Thr 
20 

He Gin Asp Lys Glu Gly He 
20 35 
He Phe Ala Gly Lys Gin Leu 

45 50 
Asp Tyr Asn He Gin Lys Glu 
60 

Arg Leu Arg Gly Gly Gly Ser 

75 
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SEP TP MO. 07 

Lys Arg Lys Arg Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 
15 10 15 

5 Ala Arg Lys lie Ala Arg Leu Gly Val Ala Lys Leu Ala Gly Leu Arg 
20 25 30 

Ala Val Leu Lys Phe 
35 

10 SEP TP no, OR 

Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu Asp Arg Lys lie 

1 5 10 15 

Asp Arg Leu Gly Val Asp Phe 
20 

15 

Example 2 

Construction of Obiquitin-lvtic Peptide Fusion Plasmids With 
Ubiouitin-ribosnmal Fusion Gene Promoter (Ubi31 

20 Exemplary and preferred pUC19 and pCGN1547 plasmid vectors 

containing a potato ( Solanum tuberosum) ubiquitin- ribosomal fusion 
promoter (Ubi3), a region coding for a ubiquitin polypeptide, and 
a gene coding for a lytic peptide are constructed. 

To obtain the genomic clone containing a ubiquitin-ribosomal 

25 fusion promoter and ubiquitin polypeptide coding region, a X.FIXII 
potato genomic library is first prescreened using PCR. The PCR 
primers are homologous to regions of the ubiquitin-ribosomal 
fusion cDNA (see Garbarino J., et ad.. Plant Molecular Biology 20: 
235(1992); Garbarino J. and Belknap W., Plant Molecular Biology 

30 24: 119 (1994); both of which are hereby incorporated by reference 
herein in their entirety). A primer 5' to the beginning ATG of 
ubiquitin and a primer complementary to a sequence near the 5 ' end 
of the ribosomal protein are used. 

The library is plated in 22 aliquots containing approximately 

35 0.5x10 s pfu (plaque forming units) each on an E. coli lawn. A plug 
is taken from each of the 22 resulting plaques and the eiuant from 
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each is subjected to PCR under standard conditions. The PCR 
products are run on agarose gels. The gels are then blotted and 
probed with the ubiquitin coding region of the ubiquitin-ribosomal 
fusion cDNA according to standard conditions. Two of the plugs 
5 produce PCR products that hybridize to the cDNA probe.. Both of 
these are the correct size for the predicted ubiquitin-ribosomal 
fusion genomic fragment. 

The eluants from these two plugs are then plated and screened 
with the ubiquitin coding region of the ubiquitin-ribosomal fusion 

10 cDNA according to standard conditions. For verification, the 

positive plaques from the initial screen are replated and screened 
with a probe containing both the ribosomal protein-coding region 
and the 3 ' end of the potato ubiquitin-ribosomal fusion cDNA. 

The genomic clones are sequenced using Sequenase version 2.0 

15 (United States Biochemical Corporation) or Promega fmol UNA. 
Sequencing System using standard conditions . A genomic clone 
containing both the ubiquitin-ribosomal fusion promoter region and 
the ubiquitin-ribosomal fusion coding region is identified. 

A chimeric gene is then constructed with a portion of the 

20 potato ubiquitin-ribosomal fusion genomic clone ligated to a lytic 
peptide gene. PCR is used to generate the Ubi3 promoter and 
ubiquitin portion of the chimeric gene. The Ubi3 promoter region 
includes the 920 bp promoter region upstream of the ubiquitin ATG. 
and the ubiquitin polypeptide coding region is 228 bp plus 6 bp of 

25 a BamHI restriction site at the 3' end (SEQ ID NO. 93). The 

primers contain BamHI restriction sites and are homologous to the 
5* end of the Ubi3 promoter and to the 3' end of the ubiquitin 
polypeptide coding region. The ubiquitin-ribosomal fusion genomic 
clone is used as the amplification template. This insert is first 

30 sub-cloned into zhe plasmid pCGN1547, as described in Garbarino et 
ai.. Plant Molecular Biology 24: 119 (1994). The Ubi3 insert is 
then isolated from pCGN1547 using the BamHI sites and ligated into 
pUC19 under standard conditions. Transformation of E. coli is 
done according to standard conditions and correct sub-clones are 

35 confirmed by mini -prep or PCR DNA analysis. This plasmid is 
designated pUCUbi3 . 
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A nucleotide fragment coding for the lytic peptide 
(corresponding to the amino acid sequence SEQ ID NO. 98) is 
synthesized using a nucleic acid synthesizer, adding a stop codon 
to the 3 ' end, and used as a PCR template . The 5 ' PCR primer 
5 homologous to the lytic peptide nucleotide sequence contains a 
BamHI site, and the 3' primer contains an XbaX site. These sites 
are used to sub-clone the PCR generated insert into pUC19. A 
nopaline synthase polyadenylation signal (N0S3 ' ) is then cloned 3 ' 
to the lytic peptide sequence. Following sequence analysis, the 

10 BamHI insert containing the Ubi3 promoter and ubiquitin coding 
region (SEQ ID NO. 93) is cloned 5* to the lytic peptide. 

After transforming E. coli under standard conditions, pUC19 
sub-clones are selected for mini -prep or PCR DNA analysis 
according to standard conditions. The direction of the promoter 

15 is confirmed and the junction sequences are verified by sequencing 
according to standard conditions. The resulting Ubi3 ubiquitin- 
lytic peptide fusion gene construct corresponds to SEQ ID NO. 92. 
Unlike previous cloning attempts using the CaMV35S promoter, as 
described in the Background section, the sequence does not reveal 

20 any point mutations in the lytic peptide sub-clones. The plasmid 
is stable in the E. coli host and did not inhibit its growth. 

The resulting pUC19 recombinant plasmid is shown in the 
plasmid map in Figure 1. The sequence for the Ubi3 -ubiquitin 
insert containing the ubiquitin-ribosomal fusion gene promoter and 

25 the ubiquitin coding region corresponds to SEQ ID NO. 93 in Table 
2 below. The sequence for the chimeric Ubi3 ubiquitin- lytic 
peptide fusion gene construct corresponds to SEQ ID NO. 92 in 
Table 2 below. This plasmid is designated as pUCUbi3-LP98. 
The entire Ubi3 ubiquitin-lytic peptide fusion gene 

20 construct, including the pciyadenylation site, was isolated from 
pUC19 as an Asp718/HindIII restriction fragment and sub-cloned 
into the pCGN1547 Agrobacterium vector for use in plant 
transformation (see McBride, et al.. Plant Molecular Biology 14: 
269 (1990). This plasmid is designated as pCGNUbi3-LP9B. 
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TABLE 2: NUCLEOTIDE SEQUENCE OF POTATO UBIQUITIN-RIBOSOMAL FUSION 
PROMOTER (UBI3) AND UBXQOXTXN CODING RBQZCN INSERT, AMD UBIOOITXN- 
LYTIC PEPTIDE FUSION GBNB CONSTRUCT 



e 
D 


CCAAAGCACA 


TACTTATCGA 


TTTAAATTTC 


ATCGAAGAGA 


TTAATATCGA 


50 




ATAATCATAT 


ACATACTTTA 


AATACATAAC 


AAATTTTAAA 


TACATATATC 


100 


10 


TGGTATATAA 


TTAATTTTTT 


AAAGTCATGA 


AGTATGTATC 


AAATACACAT 


150 




ATGGAAAAAA 


TTAACTATTC 


ATAATTTAAA 


AAATAGAAAA 


GATACATCTA 


200 


15 


GTGAAATTAG 


GTGCATGTAT 


CAAATACATT 


AGGAAAAGGG 


CATATATCTT 


250 


GATCTAGATA 


ATTAACGATT 


TTGATTTATG 


TATAATTTCC 


AAATGAAGGT 


300 




TTATATCTAC 


TTCAGAAATA 


ACAATATACT 


TTTATCAGAA 


CATTCAACAA 


350 


20 


AGCAACAACC 


AACTAGAGTG 


AAAAATACAC 


ATTGTTCTCT 


AGACATACAA 


400 




AATTGAGAAA 


AGAATCTCAA 


AATTTAGAGA 


AACAAATCTG 


AATTTCTAGA 


450 


25 


AGAAAAAAAT 


AATTATGCAC 


TTTGCTATTG 


CTCGAAAAAT 


AAATGAAAGA 


500 


AATTAGACTT 


TTTTAAAAGA 


TGTTAGACTA 


GATATACTCA 


AAAGCTATTA 


550 




AAGGAGTAAT 




CATTAAGTAT 


TTTAGTTACA 


GTCCTGTAAT 


600 


30 


TAAAGACACA 


TTTTAGATTG 


TATCTAAACT 


TAAATGTATC 


TAGAATACAT 


550 




ATATTTGAAT 


GCATCATATA 


CATGTATCCG 


ACACACCAAT 


TCTCATAAAA 


700 


35 


AACGTAATAT 


CCTAAACTAA 


TTTATCCTTC 


AAGTCAACTT 


AAGCCCAATA 


750 


TACATTTTCA 


TCTCTAAAGG 


CCCAAGTGGC 


ACAAAATGTC 


AGGCCCAATT 


800 




ACGAAGAAAA 


GGGCTTGTAA 


AACCCTAATA 


AAGTGGCACT 


GGCAGAGCTT 


850 


40 


ACACTCTCAT 


TCCATCAACA 


AAGAAACCCT 


AAAAGCCGCA 


GCGCCACTGA 


900 




, ~ rf1 " v ^ T^TC^T 


CCAGGCGAAG 


ATG CAG ATC TTC GTG AAG ACC TTA 
Met Gin lie ?he Val Lys Thr Leu 


944 



1 5 
45 

ACG GGG AAG ACG ATC ACC CTA GAG GTT GAG TCT TCC GAC ACC 986 
Thr Gly Lys Thr lie Thr Leu Glu Val Glu Ser Ser Asp Thr 
10 15 20 

50 ATC GAC AAT GTC AAA GCC AAG ATC CAG GAC AAG GAA GGG ATT 1028 
He Asp Asn Val Lys Ala Lys lie Glr. Asp Lys Glu Gly lie 
25 30 35 
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CCC CCA GAC CAG CAG CGT TTG ATT TTC GCC GGA AAG CAG CTT 1070 
Pro Pro Asp Gin Gin Arg Leu lie Phe Ala Gly Lys Gin Leu 
40 45 50 

5 

GAG GAT GGT CGT ACT CTT GCC GAC TAG AAC ATC CAG AAG GAG 1112 
Glu Asp Gly Arg Thr Leu Ala Asp Tyr Aan He Gin Lys Glu 
55 60 

10 TCA ACT CTC CAT CTC GTG CTC CGT CTC CGT GGT GGT 1148 
Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly 
65 70 75 

GGA TCC GCT GTT AAA AGA GTG GGT CGT AGG TTG AAA AAG TTG 1190 
15 Gly Ser Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 
80 85 90 

GAC CGT AAG ATT GAT AGG TTA GGA GTT GAT TTT TGATC 1228 
Asp Arg Lys He Asp Arg Leu Gly Val Asp Phe 
20 95 100 

SEP TP NO, 03 

CCAAAGCACA TACTTATCGA TTTAAATTTC ATCGAAGAGA TTAATATCGA 50 

25 ATAATCATAT ACATACTTTA AATACATAAC AAATTTTAAA TACATATATC 100 

TGGTATATAA TTAATTTTTT AAAGTCATGA AGTATGTATC AAATACACAT 150 

ATGGAAAAAA TTAACTATTC ATAATTTAAA AAATAGAAAA GATACATCTA 200 

30 

GTGAAATTAG GTGCATGTAT CAAATACATT AGGAAAAGGG CATATATCTT 250 

GATCTAGATA ATTAACGATT TTGATTTATG TATAATTTCC AAATGAAGGT 300 

35 TTATATCTAC TTCAGAAATA ACAATATACT TTTATCAGAA CATTCAACAA 350 

AGCAACAACC AACTAGAGTG AAAAATACAC ATTGTTCTCT AGACATACAA 400 

AATTGAGAAA AGAATCTCAA AATTTAGAGA AACAAATCTG AATTTCTAGA 450 

40 

AGAAAAAAAT AATTATGCAC TTTGCTATTG CTCGAAAAAT AAATGAAAGA 500 

AATTAGACTT TTTTAAAAGA TGTTAGACTA GATATACTCA AAAGCTATTA 550 

45 AAGGAGTAAT ATTCTTCTTA CATTAAGTAT TTTAGTTACA GTCCTGTAAT 600 

TAAAGACACA TTTTAGATTG TATCTAAACT TAAATGTATC TAGAATACAT 650 

ATATTTGAAT GCATCATATA CATGTATCCG ACACACCAAT TCTCATAAAA 700 

50 

AACGTAATAT CCTAAACTAA TTTATCCTTC AAGTCAACTT AAGCCCAATA 750 

TACATTTTCA TCTCTAAAGG CCCAAGTGGC ACAAAATGTC AGGCCCAATT 800 
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10 



15 



20 



25 



ACGAAGAAAA GGGCTTGTAA AACCCTAATA AAGTGGCACT GGCAGAGCTT 850 

ACACTCTCAT TCCATCAACA AAGAAACCCT AAAAGCCGCA GCGCCACTGA 900 

TTTCTCTCCT CCAGGCGAAG ATG CAG ATC TTC GTG AAG ACC TTA 944 

Met Gin He Phe Val Lys Thr Leu 
1 5 

ACG GGG AAG ACG ATC ACC CTA GAG GTT GAG TCT TCC GAC ACC 986 
Thr Gly Lys Thr He Thr Leu Glu Val Glu Ser Ser Asp Thr 
10 15 20 

ATC GAC AAT GTC AAA GCC AAG ATC CAG GAC AAG GAA GGG ATT 1028 
He Asp Asn Val Lys Ala Lys He Gin Asp Lys Glu Gly He 
25 30 35 

CCC CCA GAC CAG CAG CGT TTG ATT TTC GCC GGA AAG CAG CTT 1070 
Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys Gin Leu 
40 45 50 

GAG GAT GGT CGT ACT CTT GCC GAC TAC AAC ATC CAG AAG GAG 1112 
Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn He Gin Lys Glu 
55 60 

TCA ACT CTC CAT CTC GTG CTC CGT CTC CGT GGT GGT GGA TCC 1154 
Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Gly Ser 
65 70 75 



30 Example 3 

Conscmctinn of Ubi~:i-in-lvtic Peptide Fusion Plasmids With 
PoIyMhimiirin Promoter and Inrron (Ubi71 



Exemplary and preferred pUC19 and pCGN1547 plasmid vectors 
35 containing a potato {Solanum tuberosum) polyubiquitin promoter and 
intron (Ubi7), a region coding for a ubiquitin polypeptide, and a 
gene coding for a lytic peptide are constructed. 

To obtain the genomic clone containing a polyubiquitin 
promoter, intron and uiicruitir. polypeptide ceding region, a A.FIXI2 

40 potato genomic library was first prescreened using PCR as 

described in Example 2 above. The PCR primers are homologous to 
regions of the polyubicruitin cDNA (see Garbarino J., ec aJ., Plant 
Molecular Biology 20: 235!1992)). A primer homologous to the 5' 
untranslated region of \ibiquitin in the polyubiquitin cDNA and a 

45 primer complementary to the amino terminus of the ubiquitin coding 
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region in the polyubiquitin cDNA are used. A genomic clone 
containing both the polyubiquitin promoter region, intron, and the 
polyubiquitin coding region was identified. 

A chimeric gene is then constructed with a portion of the 
5 potato polyubiquitin genomic clone ligated to a lytic peptide 
gene, as described in Example 2. PCR is used to generate the 
"Jbi7-ubiquitin portion of the chimeric gene. The Ubi7 promoter 
region includes the 1220 bp promoter and 568 bp intron upstream of 
rhe ubiquitin ATG, and the ubiquitin polypeptide coding region is 

10 228 bp plus 6 bp of a BamHI restricrion site (SEQ ID NO. 96) . 
This plasmid is designated pUCUbi7 . 

A nucleotide fragment coding for the lytic peptide 
corresponding to the amino acid sequence SEQ ID NO. 98) is 
generated as described in Example 2 . The resulting Ubi7 

15 ubiquitin- lytic peptide fusion gene construct corresponds to SEQ 
ZD NO. 95. Unlike previous cloning attempts using the CaMV35S 
promoter as described in the Background section, the sequence does 
not reveal any point mutations in che lytic peptide sub-clones. 
The plasmid was stable in the E. ccli host and did not inhibit its 

20 growth. 

The resulting pUC19 recombinant plasmid is shown in the 
plasmid map in Figure 2 . The sequence for the PCR insert 
containing the polyubiquitin promoter, intron, and the ubiquitin 
coding region corresponds to SEQ ID NO. 96 in Table 3 below. The 

25 sequence for the chimeric Ubi7 ubicuitin-lytic peptide fusion gene 
construct corresponds to SEQ ID NO. 95 in Table 3 below. This 
plasmid is designated as pUCUbi7-L?98 . 

The entire Ubi7 ubiquitin-lytic peptide fusion gene 
construct, including the polyadenylation site, is isolated from 

30 -"JC19 as an Asp718/partial Hindll!" restriction fragment ithe 
intron has an internal Hindlll site) and sub-cloned into the 
pCGN1547 Agrobacterium vector for use in plant transformation. 
This plasmid is designated pCGNUbi?-LP98. 
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TABLB 3: NOCLBOTIDB SEQUENCE 07 POTATO POIiYDBIQUITIN PROMOTER 
REGION (OB 17 ) AHD UBIQUITIN CODING REGION INSERT, AND UBIQOTTIN- 
IOTIC PEPTIDE FUSION GENE CONSTRUCT 



5 SBQ IP Nff. 95 





' I " i ■ I *7i r VC* A ZVTY"* 

X X Xt\ IWinlU 




A XftXAAnXAA 


ATATAAATTG 


TCTCAATAAT 


50 




Inn 


nL iAnlAi X 1 


uAAAXV X\_nft 


MIINiail^llV <%lllf II 
XXX lAl\3Ai X 


xxx x Annx xv* 


1 no 

X V u 


10 


nv^i A XaXAXw 


LAAUAUinl 1 




A & & if ■I*l*l B l B l , Ji 
nAnnu XIX X A 


TTl & AM ATT*! 1 

X 1/iiVUiAl X X 






Af* R r r r r7vf^T r r f T' 


TGTTGATGAG 

4U1 AwAA\3aT%w 


GATGACAAGA 


TNTTGGTCAT 

ill A X i 1 i A 


CAATTACATA 


200 


15 




f^AATAfiTA AR 


P AAt "I'HTAAT 


f^TTTTPPATA 

Vj X X A x iwun 


ATGATAATCA 


250 


CAGACACAAN 


NNAAACCCAT 


TTATPA TIT* A 

a ini ini iwi 


CATTGATTGA 


GTITTATATG 


300 




CAATATAGTA 


1 TAATAATAA 


TATTTY^TTAT 


AAAGCAAGAG 


GTCAATTTTT 


350 


20 


TTTTAATTAT 


ACCACGTCAC 


TAAATTATAT 


TTGATAATGT 


AAAACAATTC 


400 




AAATTTTACT 


TAAATATCAT 


GAAATAAACT 


ATTTTTATAA 


CCAAATTACT 


450 


25 


AAATTTTTCC 


AATAAAAAAA 


AGTCATTAAG 


AAGACATAAA 


ATAAATTTGA 


500 


GGTAAANGAG 


T3AAGTCGAC 


TGACTITTTT 


n>m>TTTATC 


ATAAGAAAAT 


550 




AAATTATTAA 


CTTTAACCTA 


ATAAAACACT 


AATATAATTT 




OUU 


:o 


AATACTTACC 


TCTTAGAAAT 


AAGAAAAAGT 


GTTTCTAATA 


GACCCTCAAT 


650 




TTACATTAAA 


TATTTTCAAT 


CAAATTTAAA 


TAACAAATAT 


CAATATGAGG 


700 


35 


TCAATAACAA 


TATCAAAATA 


ATATGAAAAA 


AGAGCAATAC 


ATAATATAAG 


750 


GGACGATTTA 


AGTGCGATTA 


TCAAGGTAGT 


ATTATATCCT 


AATTTGCTAA 


800 




TATTTGNGCT 


CITATATTTA 


AGGTCATGTT 


CATGATAAAC 


TTGAAATGCG 


850 


40 


CTATATTAGA 


GCATATATTA 


AAATAAAAAA 


ATACCTAAAA 


TAAAATTAAG 


900 




TTATTTTTAG 


TATATATTTT 


TTTACATGAC 


CTACATTTTT 


CTGGGTTTTT 


950 


45 


CTAAAGGAGC 


GTGTAAGTGT CGACCTCATT 


CTCCTAATTT 


TCCCCACCAC 


1000 


ATAAAAATTA 


AAAAGGAAAG 


GTAGC TTTTG 


CGTGTTGTTT 


TGGTACACTA 


1050 




CACCTCATTA 


TTACACGTGT 


CCTCATATAA 


TTGGTTAACC 


CTATGAGGCG 


1100 


50 


GTTTCGTCTA 


GAGTCGGCCA TGCCATCTAT 


AAAATGAAGC 


TTTCTGCACC 


1150 




TCA'mTHT 


CATCTTCTAT 


CTGATTTCTA 


TTATAATTTC 


TCTCAATTGC 


1200 



39 



WO 96/03522 



PCT/US95/09338 



CTTCAAATTT CTCTTTAAGG TTAGAATCTT CTCTATTTTT 1240 

GGTTTTTGTA TGTTTAGATT CTCGAATTAG CTAATCAGGC GCTGTTATAG 1290 

5 

CCCTTCCTTT TGAGTCTCTC CTCGGTTGTC TTGATGGAAA AGGCCTAACA 1340 

TTTGAGTTTT TTTACGTCTG GTTTGATGGA AAAGGCCTAC AATTGGCCGT 1390 

10 TTTCCCCGTT CGTTTTGATG AAAAAGCCCC TAGTCTGAGA TITTTTTICT 1440 

GTCGTTCGTT CTAAAGGTTT AAAATTAGAG TTTTTACATT TGTTTGATGA 1490 

AAAAGGCCTT AAATTTGAGT TTTTCCGGTT GATTTGATGA AAAAGCCCTA 1540 

15 

GAATTTCTGT TTTTCCGTCG GTTTGATTCT GAAGGCCTAA AATTTGAGTT 1590 

TCTCCGGCTG TTTTGATGAA AAAGCCCTAA ATTTGAGTTT CTCCGGCTGT 1640 

20 TTTGATGAAA AAGCCCTAAA TTTGAAGTTT TTTCCCCGTG TTTTAGATTG 1690 

TTTAGG1TTT AATTCTCGAA TCAGCTAATC AGGGAGTGTG AAAGCCCTAA 1740 

ATTGAAG1TT TTTTCGTTGT TCTGATTGTT GTTTTTATGA ATTTGCAG 1788 

25 

ATG CAG ATC TTT GTG 
Met Gin He Phe Val 
1 5 

30 CTA GAG GTG GAA AGT 
Leu Glu Val Glu Ser 

15 

AAG ATC CAG GAT AAG 
35 Lys He Glu Asp Lys 
30 

CTT ATC TTC GCC GGA 
Leu He Phe Ala Gly 
40 45 

GCT GAT TAG AAC ATC 
Ala Asp Tyr Asn He 
60 

45 

CTC CGT CTA CGT GGA 
Leu Arg Leu Arg Gly 
75 

50 CGT AGG TTG AAA AAG 
Arg Arg Leu Lys Lys 
85 



AAA ACT CTC ACC GGA AAG ACT ATC ACC 1830 
Lys Thr Leu Thr Gly Lys Thr He Thr 
10 

TCT GAT ACA ATC GAC AAC GTT AAG GCT 1872 
Ser Asp Thr He Asp Asn Val Lys Ala 

20 25 

GAA GGA ATT CCC CCG GAT CAG CAA AGG 1914 
Glu Gly He Pro Pro Asp Gin Gin Arg 
35 40 

AAG CAG TTG GAG GAC GGA CGT ACT CTA 1956 
Lys Gin Leu Glu Asp Gly Arg Thr Leu 
50 55 

CAG AAG GAG TCT ACC CTC CAT TTG GTG 1998 
Gin Lys Glu Ser Thr Leu His Leu Val 
65 70 

GGT GGA TCC GCT GTT AAA AGA GTG GGT 2040 
Gly Gly Ser Ala Val Lys Arg Val Gly 
80 

TTG GAC CGT AAG ATT GAT AGG TTA GGA 2082 
Leu Asp Arg Lys He Asp Arg Leu Gly 
90 95 
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3TT GAT TTT TGATCTAGAG TCGACCGATC CCCCGAATTT CCCCGA 2127 





Val Asp Phe 
100 










5 


SEO TD NTl Qfi 
TTTATCAATC AGATTTGAAC 


X TX T"A X X T»7A X 

AiAlAAAlAA 


xTtx'I '& x AT* I if"* 


lv- 1V.AA1 AAJ. 


CCA 




TCTACATTAA 


ACTAATATTT 


/"» X X XHV»>Ty* X TV 

Vj AAA 1\- 1VAA 


111 lAjAiAl X 


111 1AAA1 J.V- 


±00 


10 


ACTTTATATC 


CAAGACAATT 


INLANv. 1 1\.A 


AAAAkj 1 1 1 1 A 


1 lAAANAl 1 1 


1 EA 




ACATTAGTTT 


TGTTGATGAG 


IjATTjALAAuA 


-LTJ 1 lvio I\. A 1 


x ja nvrii r" x mx 

LAA 1 lA^AlA 


inn 
i 00 


15 


TACCCAAATT 


GAATAGTAAG 


CAACl IwiAT 




x nv' ta mn x mn x 
A ioA 1 AA l\iA 




CAGACACAAN 


NNAAACCCAT 


TTATTATTCA 


CAI lviAl JAjA 


GTTTTATATG 


■5 ft #\ 

300 




CAATATAGTA 


ATAATAATAA 


TATTTCTTAT 


AAAvjCAALjACj 


GTa-AA'1"1"1"1"1' 


left 

350 


it V 


TTTTAATTAT 


ACCACGTCAC 


TAAATTATAT 


"vtv^ yi rrix Tinrvm 
. iunlAAlb 1 


X A 7i AfTA JAM *1 

AAAAL. AA 1 1 V 


A Aft 
900 




AAATTTTACT 


TAAATATCAT 


r* ta b. a t* 7v xf»ip 
LjAAAIAAAL. 1 


* •TTT'T' 1*1 IX *"ITTA 7A 
-TV I 111 lAlAft 


jT* A A AM " M APT 1 
LAfth 1 1 AU 1 


4311 


25 


AAATTTTTCC 


AATAAAAAAA 


A<j IvAi 1 AAVj 


S-i^APATAAA 
.-vnbnLn 1 AAA 


XTA ta ]V?|«irX 
A1AAA1 1 luA 


enn 


GGTAAANGAG 


TGAAGTCGAC 


1 viAL. 1 1 i I i 1 


* 1 1 1 1 1 inlv. 


7iTA AO A & Ml' 
AlAAvaAAAAl 






- — ^ iAl inn 


Lil lAnLLln 


J. rpX X X Tip 7\f— *T» 
AlAAAAL.nL. i 


i A.'PTA'PA A'l" i ** i * 
.%ninlnnl 1 i. 


P ATY^R A ATPT 
luVjnn 1\— 1 


ouu 


30 


AATACTTACC 


TCTTAGAAAT 


A A(3 AA AA ART 


rZ'I B l > fY" ,r TAA , T 1 A 




650 




TTACATTAAA 


TATTTTCAAT 


CAAATTTAAA 


TAACAAATAT 


CAATATGAGG 


700 


35 


TCAATAACAA 


TATCAAAATA 


ATATGAAAAA 


AGAGCAATAC 


ATAATATAAG 


750 


GGACGATTTA 


AGTGCGATTA 


TCAAGGTAGT 


ATTATATCCT 


AATTTGCTAA 


800 




TATTTGNGCT 


CTTATATTTA 


AGGTCATGTT 


CATGATAAAC 


TTGAAATGCG 


850 


40 


CTATATTAGA 


GCATATATTA 


AAATAAAAAA 


ATACCTAAAA 


TAAAATTAAG 


900 




TTA'ITI'ITAG 


TATATATTTT 


TTTACATGAC 


CTACATTTTT 


CTGGGTTTTT 


950 


45 


-TAAAGGAGC 


GTGTAAGTGT 


CGACCTCATT 


CTCCTAATTT 


TCCCCACCAC 


1000 


ATAAAAATTA 


AAAAGGAAAG 


GTAGCTTTTG 


CGTGTTGTTT 


TGGTACACTA 


1050 




CACCTCATTA 


TTACACGTGT 


CCTCATATAA 


TTGGTTAACC 


CTATGAGGCG 


1100 


50 


3TTTCGTCTA 


GAGTCGGCCA 


TGCCATCTAT 


AAAATGAAGC 


TTTCTGCACC 


1150 




7CATTTTTTT 


CATCTTCTAT 


CTGAT1TCTA 


TTATAATTTC 


TCTCAATTGC 


1200 
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10 



20 



25 



30 



35 



40 



CTTCAAATTT CTCTTTAAGG TTAGAATCTT CTCTATTTTT 

GGTTTTTGTA TGTTTAGATT CTCGAATTAG CTAATCAGGC 

CCCTTCCTTT TGAGTCTCTC CTCGGTTGTC TTGATGGAAA 

TTTGAGTTTT TTTACGTCTG GTTTGATGGA AAAGGCCTAC 

TTTCCCCGTT CGTTTTGATG AAAAAGCCCC TAGTTTGAGA 

GTCGTTCGTT CTAAAGGTTT AAAATTAGAG TTTTTACATT 

AAAAGGCCTT AAATTTGAGT TTTTCCGGTT GATTTGATGA 

GAATTTGTGT TTTTC CGTCG GTTTGATTCT GAAGGCCTAA 

TCTCCGGCTG TTTTGATGAA AAAGCCCTAA ATTTGAGTTT 

TTTGATGAAA AAGCCCTAAA TTTGAAGTTT TTTCCCCGTG 

TTTAGGTTTT AATTCTCGAA TCAGCTAATC AGGGAGTGTG 

ATTGAAGTTT TTTTCGTTGT TCTGATTGTT GTTTTTATGA 

ATG CAG ATC TTT GTG AAA ACT CTC ACC GGA AAG 
Met Gin lie Phe Val Lys Thr Leu Thr Gly Lys 
15 10 

CTA GAG GTG GAA AGT TCT GAT ACA ATC GAC AAC 
Leu Glu Val Glu Ser Ser Asp Thr He Asp Asn 
15 20 25 

AAG ATC CAG GAT AAG GAA GGA ATT CCC CCG GAT 
Lys He Glu Asp Lys Glu Gly He Pro Pro Asp 
30 35 

CTT ATC TTC GCC GGA AAG CAG TTG GAG GAC GGA 
Leu lie Phe Ala Gly Lys Gin Leu Glu Asp Gly 
45 50 

GCT GAT TAC AAC ATC CAG AAG GAG TCT ACC CTC 
Ala Asp Tyr Asn He Gin Lys Glu Ser Thr Leu 
60 65 




ACT ATC ACC 
Thr He Thr 



GTT AAG GCT 
Val Lys Ala 



CAG CAA AGG 
Gin Gin Arg 
40 

CGT ACT CTA 
Arg Thr Leu 
55 

CAT TTG GTG 
His Leu Val 
70 



1872 



1914 



1956 



1998 



45 CTC CGT CTA CGT GGA GGT GGA TCC 
Leu Arg Leu Arg Gly Gly Gly Ser 
75 



2022 
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Construction of XJhi nmifcin-Lvtic Peptide Fusion Gen* Plamniri 

Victors 

5 pUC19 and pCGN1547 plasmid vectors containing a potato 

(Soianum tuberosum) Ubi3 promoter, a region coding for a ubiquitin 
polypeptide, and a gene coding for a lytic peptide are constructed 
according to Example 2. Bach plasmid respectively contains one 
lytic peptide nucleotide sequence coding for an amino acid 

10 sequence corresponding co SEQ ID NO. 1, 7, 15, 21, 30, 29, 43, 52, 
83, 86, 88, 90, and 91. The resultant pUC19 Ubi3 ubiquitin-lytic 
peptide recombinant plasmids are designated as follows: pUCUbi3- 
LP1, pUCUbi3-LP7, pUC0bi3-LP15, pUCUbi3-LP21, pUCUbi3-LP30, 
pUCUbi3-LP39, pUCUbi3-LP43, pUCUbi3-LP52 , pUCUbi3-LP83 . pUCUbi3- 

15 LP86, pUCUbi3-LP88, pUC0bi3 -LP9 0 , and pUCUbi3-LP91. The resultant 
PCGN1547 Ubi3 ubiquitir.-lytic peptide recombinant plasmids are 
designated as follows: pCGNUbi3-LPl, pCGNUbi3-LP7, pCGNUbi3-LP15, 
pCGNObi3-LP21, pCGNUbi3-LP30. pCGNUbi3-LP39 , pCGNUbi3-LP43, 
pCGNUbi3-LP52, pCGNUbi3-LP83 , pCGNUbi3-LP86 , pCGNUbi3-LP88, 

20 pCGNUbi3-LP90, and pCGNUbi3-LP91 . 

pUC19 and pCGN1547 plasmid vectors containing a potato 
(Soianum tuberosum) r _"bi7 promoter and intron, a region ceding for 
a ubiquitin polypeptide, and a gene coding for a lytic peptide are 
constructed according to Example 3 . Each plasmid respectively 

25 contains one lytic peptide nucleotide sequence coding for an amino 
acid sequence corresponding to SEQ ID NO. 1, 7, 15, 21, 30, 39, 
43, 52, 83, 86, 88, ?0, and 91. The resultant pUC19 Ubi7 
ubiquitin-lytic peptide recombinant plasmids are designated as 
follows: pUCUbi7-LPl, ?UCUbi7-LP7, pUCUbi7-LP15, pUCUbi7-LP21, 

30 pUCUbi7-LP30, pUCUbi"-^?39 , pUCUbi7-LP43 , pUCUbi7-LP52 , p'JCUbi7- 
LP83, pUCUbi7-LP86, pUC0bi7-LP88, pUCUbi7-LP90, and pUCUbi7-LP91 . 
The resultant pCGN1547 Ubi7 ubiquitin-lytic peptide recombinant 
plasmids are designated as follows: pCGNUbi7-LPl, pCGNUbi7-LP7, 
pCGNUbi7-LP15, pCGNUbi7-LP21 , pCGNUbi7-LP30 , pCGNUbi7-LP39, 

35 pCGNUbi7-LP43, pCGNUbi7 -LP52 , pCGNUbi7-LP83 , pCGNUbi7-LP86, 
pCGNUbi7-LP88, pCGNUbi?-LP90, and pCGNUbi7-LP91 . 
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Example 5 

Construction of GDS-Ubiouitin Fusion Gene Recombinant. DNA 
Molecules a nd Obioui tin Promoter-GUS Recombinant DMA MnlPf-nlcq 

5 

Two chimeric genes containing a B-glucuronidase (GUS) 
reporter gene and the Ubi3 promoter were constructed inpCGN1547 
plasodd vectors according to Garbarino, J., and Belknap, W. , Plant 
Molecular Biology 24: 119 (1994), hereby incorporated by reference 

10 in its entirety. The first vector contains the 920 bp Ubi3 

promoter ligated to the GUS gene, and expresses the GUS protein. 
This plasmid is designated pCGNUbi3-GUS . The second vector 
contains the 920 bp Ubi3 promoter and 228 bp ubiquitin coding 
region ligated in frame to the GUS gene. This plasmid expresses a 

15 ubiquitin-GUS fusion polypeptide. This plasmid is designated 
pCGNUbi3-GUSf . 

Two chimeric genes containing a S-glucuronidase (GUS) 
reporter gene and the Ubi7 promoter minus the intron region were 
constructed in pCGN1547 plasmid vectors using PCR, as described in 

20 Example 3 and in Garbarino, J., and Belknap, w. , Plant Molecular 
3iology 24: 119 (1994) . The first vector contains a 1156 bp Ubi7 
promoter region insert, including the 5' untranslated region of 
ubiquitin, ligated to the GUS gene. This plasmid does not contain 
the Ubi7 intron and expresses the GUS protein. This plasmid is 

25 designated pCGNUbi7 -GUS . The second vector contains the 1156 Ubi7 
ubiquitin promoter from pCGNUbi7-GUS and the 228 bp ubiquitin 
coding region fused in frame to the GUS reporter gene. This 
plasmid expresses a ubiquitin-GUS fusion polypeptide and is 
designated pCGNUbi7 -GUS f . 

20 

Example 6 

Plant Transformation and GUS Gene Expression 



The chimeric plasmids pCGNUbio-GUS, pCGNUbi3-GUSf , pCGNUbi7- 
35 JUS, and pCGNUbi7-GUSf from Example 5 are introduced into the 
potato ( Solanum tuberosum) using Agrobacterium mediated 
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transformation according to Garbarino, J., and Belknap, w. Plant 
Molecular Biology 24:119 (1994). The strain of Agrobacterium 
cumefaciens used for transformation (PC2760, see An, 6., et al., 
EMBO J. 4: 277 (1985)) harbors the disarmed Ti plasmid pAL4404 
5 (see Hoekema, A., et al., Nature 303: 179 (1983). Plant 

transformation is carried out as previously described in Synder, 
G.W., et al.. Plant Cell Rep 12:324 (1993), except that- 1 mg/1 
silver thiosulfate is added to the stage II transformation medium 
(see Chang, H.H., et al., Bot Bull Acad Sci 32: 63 (1991). 

10 Expression of the ubiquitin-GUS fusion polypeptide and mRNA 

products and the GUS protein alone is examined by northern and 
western analysis, as described in Garbarino J., and Belknap, w.. 
Plant Molecular Biology 24: 119 (1994). GUS protein expression is 
examined in the transgenic plants using western analysis. 

15 Although there is a wide range of activity among individual 

clones, the ubiquitin-GUS fusion polypeptide containing plants 
generally give 5-10 fold higher expression than the plants 
containing GUS protein alone. This higher level of protein 
expression corresponds to similarly elevated mRNA transcription 

20 levels for the ubiquitin-GUS fusion constructs, as shown by 

northern analysis (described in Garbarino et al.. Plant Molecular 
Biology 24: 119 (1994)). Western analysis also shows that the 
ubiquitin-GUS fusion polypeptide was appropriately processed by 
endogenous ubiquitin hydrolases to produce free GUS protein. 

25 GUS protein activity is measured as described by Jefferson, 

R.A., et al., EMBO J. 6: 3901 (1987). Table 4 below shows a 
comparison of the GUS activities in plants transformed with 
pCGNUbi3-GUS (ubi-) and plants transformed with pCGNUbi3-GUSf 
(ubi+) . The activity is measured in nmoles methyl umbelliferon 

2C (MU) per ra-^iute per milligram of protein. Methyl umbelliferon is 
the fluorescent product of the GUS enzymatic reaction. 



45 



WO 96/03522 



PCT/US95/09338 



TABLE 4: COMPARISON OF GDS PROTEIN ACTIVITY IN PLANTS TRANSFORMED 
WITH THE UBI3 PROMOTER WITH (+UBI) AND WITHOUT (-OBI) UBIQUITIN 

POLYPEPTIDE FUSION 



Construct 


GUS Activity (nmoles MU/min/mq protein) 




Leaf 

Meristem 


2nd Leaf 


5th Leaf 


Senescent 
Leaf 


Tuber 


3 .2-ubi 


6.31+.0.74 


2.51+0.52 


1.79+.0.22 


5.42±1.24 


3.26+0.27 


8.1-ubi 


25.8±2.08 


9.98+.2.10 


6.34±1.00 


19.20±6.11 


14 . 2+.1 . 6 


3 . 5+ubi 


94.8±12.6 


60.3±25.1 


32.7±8.71 


50.1+11.6 


37 . 6±10 . 4 


9 . 8+ubi 


33.3+0.5 


18.9±2.75 


9.74+.0.99 


22.7+3.57 


20.7+3.45 



5 

Example 7 

Plant Transformation and Ubicruitin-Lvtic Peptide Gene Expression 



The chimeric plasmids pCGNUbi3-LP98 from Example 2 and 

10 pCGNUbi7-LP98 from Example 3 are introduced into the potato 

(Solanum tuberosum) using Agrobacterium mediated transformation 
according to Garbarino, J., and Belknap, W. Plant Molecular 
Biology 24:119 (1994) . The strain of Agrobacterium tumefaciens 
used for transformation (PC2760, see An, G. , et ai., EMBO J. 4: 

15 277 (1985)) harbors the disarmed Ti plasmid pAL4404 (see Hoekema, 
A., et al., Nature 303: 179 (1983). Plant transformation is 
carried out as previously described in Synder, G.W., et al.. Plant 
Cell Rep 12:324 (1993), except that 1 mg/1 silver thiosulfate is 
added to the stage II transformation medium (see Chang, H.H., et 

20 al., Bot Bull Acad Sci 32: 63 (1991). 

Expression of the ubiquitin-lytic peptide fusion polypeptide 
and mRNA products is examined by northern and western analysis, as 
described in Example 6 and Garbarino J., and Belknap, W., Plant 
Molecular Biology 24: 119 (1994). Northern analysis shows that 

25 ubiquitin-lytic peptide mRNA is transcribed from the gene 

construct in the transgenic plants . Western analysis shows that 
the ubiquitin-lytic peptide fusion polypeptide is appropriately 
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processed by endogenous ubiquitin hydrolases to produce free lytic 
peptide . 

Example 8 

nrmt»ri tlbj ^/TThi7 Prrr^ rpr Activity 

mRNA expression from the cloned Ubi3 promoter was examined 
before and after wounding to determine if the cloned Ubi3 promoter 
is wound inducible in transformed plants (see Garbarino, J. and 
3elknap, W. , Plant Molecular Biology 24:119 (1994)). Northern 
analysis comparing endogenous Ubi3 mRNA expression levels to 
pCGNUbi3-GUS and pCGNUbi3-GUSf mRNA expression levels in 
transformed plants (see Example 5) shows that while the endogenous 
Ubi3 mRNA transcription increases upon wounding, transcription 
from the recombinant Ubi3 plasmids does not. Thus the recombinant 
Ubi3 promoter does not have the wound inducible characteristic of 
-he endogenous Ubi3 promoter. This result suggests that the 920 
bp of upstream sequence cloned in the Ubi3 genomic clone is not 
sufficient to obtain wound-dependent activation of the promoter. 
The promoter instead is constitutive, however, it still 
demonstrates developmental regulation, as shown in Table 4 above- 
in contrast, the cloned UbiT promoter retains its wound- 
dependent induction. Northern analysis comparing the endogenous 
Ubi7 mRNA expression levels to the expression levels from 
P CGNUbi7-GUS and pCGNUbi7-GUSf in transformed plants (see Example 
5) shows that both the endogenous and the cloned Ubi7 promoter 
have wound-dependent activation. 
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DEPOSIT INFORMATION 

E. coli cultures, each respectively transformed with pUCUbi7- 
5 LP98 (Local Accession No. PBT-0273), pOCUbi3-LP98 (Local Accession 
No. PBT-0276), pUCUbi7 (Local Accession No. PBT-0277), and pUC0bi3 
(Local Accession No. PBT-0234) were deposited in the Agricultural 
Research Service Culture Collection (NRRL) . The depository is 
located at located at 1815 North University Street, Peoria, IL, 
10 61604. 
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SEQUENCE LISTING 



) GENERAL INFORMATION: 



(i) APPLICANT: 

(A) NAME: DEMETER BIOTECHNOLOGIES, LTD. 

(B) STREET : 905 W. MAIN ST., BRIGHTLEAF SQUARE STE 19-D 

(C) CITY: DURHAM 

(D) STATE : NORTH CAROLINA 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 27701 

(G) TELEPHONE: (919)682-7181 

(H) TELEFAX: (919)682-834 0 

(ii) TITLE OF INVENTION: UBIQUITIN-LYTIC PEPTIDE FUSION GENE 
CONSTRUCTS, PROTEIN PRODUCTS DERIVING THEREFROM, AND 
METHODS OF MAKING AND USING THE SAME 



(iii) NUMBER OF SEQUENCES: 98 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: WORDPERFECT 5.1+ 



(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 21-JUL-1994 

(vi) PRIOR APPLICATION DATA: 08/279 , 472 

(A) APPLICATION NUMBER: 08/279,472 

(B) FILING DATE: 22-JUL-1994 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 
(Vi) ORIGINAL SOURCE: SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 

Phe Ala Val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 
15 10 15 

Lys Ala Val Lys Lys Ala Val Lys Lys Lys Lys 
20 25 



(3) INFORMATION FOR SEQ ID NO: 2: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE : SYNTHETIC 
(Vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

Phe Ala Val Ala Val Lys Ala Val Ala Val Lys Ala Val Lys Lys Ala 
15 10 15 

Val Lys Lys Val Lys Lys Ala Val Lys Lys Ala Val Lys Lys Lys Lys 
20 25 30 
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(4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3 

Phe Ala Val Ala Val Lys Ala Val Ala Val Lys Ala Val Ala Val Lys 
15 10 15 

Ala Val Lys Lys Ala Val Lys Lys Val Lys Lys Ala Val Lys Lys Ala 



Val Lys Lys Lys Lys 
35 

(5) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

Phe Ala Val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 



20 



25 



30 



1 



5 



10 



15 



Lys 



Ala Val Lys Lys Ala Val 
20 
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(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

Phe Ala Val Ala Val Lys Ala Val Ala Val Lys Ala Val Lys Lys Ala 
15 10 15 

Val Lys Lys Val Lys Lys Ala Val Lys Lys Ala Val 
20 25 

(7) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

Phe Ala Val Ala Val Lys Ala Val Ala Val Lys Ala Val Ala Val Lys 
15 10 15 

Ala Val Lys Lys Ala Val Lys Lys Val Lys Lys Ala Val Lys Lys Ala 
20 " 25 30 

Val 
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(8) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE : SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 

Phe Ala Val Gly Leu Arg Ala lie Lys Arg Ala Leu Lys Lys Leu Arg 
1 5 10 15 

Arg Gly Val Arg Lys Val Ala Lys Arg Lys Arg 
20 25 

(9) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

Phe Ala Val Gly Leu Arg Ala He Lys Arg Ala Leu Lys Lys Leu Arg 
1 5 10 15 * 

Arg Gly Val Arg Lys Val Ala 
20 
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(10) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 

(B> TYPE: AMINO ACID 
(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

Lys Arg Lys Arg Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 
15 10 15 

Ala Arg Lys lie Ala Arg Leu Gly Val Ala Phe 
20 25 

(11) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 23 

(B> TYPE: AMINO ACID 
(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 
(Vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu Ala Arg Lys lie 
1*5 10 15 

Ala Arg Leu Gly Val Ala Phe 
20 
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(12) INFORMATION FOR SEQ ID NO: 11: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3 1 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

( iii ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

Phe Ala Val Gly Leu Arg Ala lie Lys Arg Ala Leu Lys Lys Leu Arg 
15 10 15 

Arg Gly Val Arg Lys Val Ala Lys Arg Lys Arg Lys Lys Asp Leu 
20 25 30 



(13) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

Phe Ala Val Gly Leu Arg Ala lie Lys Arg Ala Leu Lys Lys Leu Arg 
1 5 10 15 

Arg Gly Val Arg Lys Val Ala Lye Asp Leu 
20 25 
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(14) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 

(B) TYPE : AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i i i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 13 

Lys Arg Lys Arg Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 
15 10 15 

Ala Arg Lys lie Ala Arg Leu Gly Val Ala Phe Lys Asp Leu 
'20 " 25 30 

(15) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 

(B> TYPE: AMINO ACID 
(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu Ala Arg Lys lie 
15 10 15 

Ala Arg Leu Gly Val Ala Phe Lys Asp Leu 
20 25 
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(16) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
15 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Val 
20 25 

(17) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
1 5 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Lys Val Ala Val Ala Val 
20 * 25 30 
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(88) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 



Gin Lys Leu Cys Glu Arg Pro Ser 
1 * 5 

Asn Asn Asn Ala Cys Lys Asn Gin 
20 



Gly Thr Trp Ser Gly Val Cys Gly 
10 15 

Cys lie Asn Leu Glu Lys Ala Arg 
25 30 



His Gly Ser Cys Asn Tyr Val Phe Pro Ala His Lys 
35 40 

(89) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 

Gin Arg Val Cys Asp Lys Pro Ser Gly Thr Trp Ser Gly Leu Cys Gly 
1 " 5 10 15 

Asn Asn Asn Ala Cys Arg Gin Asn Cys He Gin Val Asp Arg Ala Lys 
20 25 30 

Lys Gly Ser Cys Gin Phe Leu Tyr Pro Ala Lys Lys 
35 40 
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(90) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 

(B) TYPE : AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i i i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 

Gin Lys Leu Cys Gin Arg Pro Ser Gly Thr Trp Ser Gly Val Cys Gly 
1 5 10 15 

Asn Asn Asn Ala Cys Lye Asn Gin Cys lie Arg Leu Glu Lys Ala Arg 
20 25 30 

His Gly Ser Cys 
35 

(91) INFORMATION FOR SEQ ID NO: 90: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 

Gin Arg Val Cys Asn Lys Pro Ser Gly Thr Trp Ser Gly Leu Cys Gly 
1 5 10 15 

Asn Asn Asn Ala Cys Arg Gin Asn Cys lie Lys Val Asp Arg Ala Lys 
20 25 30 

Lys Gly Ser Cys 
35 
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(92) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
<vi) ORIGINAL SOURCE : SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 

Met Leu Glu Glu Leu Phe Glu Glu Met Thr Glu Phe lie Glu Glu Val 
15 10 15 

lie Glu Thr Met 
20 

(93) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1228 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE STRANDED 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: GENOMIC DNA AND OTHER NUCLEIC ACID 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


92 




CCAAAGCACA 


TACTTATCGA 


TTTAAATTTC 


ATCGAAGAGA 


TTAATATCGA 


50 


ATAATCATAT 


ACATACTTTA 


AATACATAAC 


AAATTTTAAA 


TACATATATC 


100 


TGGTATATAA 


TTAATTTTTT 


AAAGTCATGA 


AGTATGTATC 


AAATACACAT 


150 


ATGGAAAAAA 


TTAACTATTC 


ATAATTTAAA 


AAATAGAAAA 


GATACATCTA 


200 


GTGAAATTAG 


GTGCATGTAT 


CAAATACATT 


AGGAAAAGGG 


CATATATCTT 


250 


GATCTAGATA 


ATTAACGATT 


TTGATTTATG 


TATAATTTCC 


AAATGAAGGT 


300 


TTATATCTAC 


TTCAGAAATA 


ACAATATACT 


TTTATCAGAA 


CATTCAACAA 


350 


AGCAACAACC 


AACTAGAGTG 


AAAAATACAC 


ATTGTTCTCT 


AGACATACAA 


400 


AATTGAGAAA 


AGAATCTCAA 


AATTTAGAGA 


AACAAATCTG 


AATTTCTAGA 


450 


AGAAAAAAAT 


AATTATGCAC 


TTTGCTATTG 


CTCGAAAAAT 


AAATGAAAGA 


500 


AATTAGACTT 


TTTTAAAAGA 


TGTTAGACTA 


GATATACTCA 


AAAGCTATTA 


550 


AAGGAGTAAT 


ATTCTTCTTA 


CATTAAGTAT 


TTTAGTTACA 


GTCCTGTAAT 


600 
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TAAAGACALA 


1 TTTAGAT I t> 


1A1V.I nnAL 1 


iiUiHlulnJ. V. 




© 50 


ATATTTGAAT 


G CAT CAT ATA 


CATGTATCCG 


ACACACCAAT 


TCTCATAAAA 


700 


AACGTAATAT 


CCTAAACTAA 


TTTATCCTTC 


AAGTCAACTT 


AAGCCCAATA 


750 


TACATTTTCA 


TCTCTAAAGG 


CCCAAGTGGC 


ACAAAATGTC 


AGGCCCAATT 


800 


ACGAAGAAAA 


GGGCTTGTAA 


AACCCTAATA 


AAGTGGCACT 


GGCAGAGCTT 


850 


ACACTCTCAT 


TCCATCAACA 


AAGAAACCCT 


AAAAGCCGCA 


GCGCCACTGA 


900 


TTTCTCTCCT 


CCAGGCGAAG 


ATG CAG ATC TTC GTG AAG ACC TTA 


944 






Met Gin lie Phe Val Lys Thr Leu 
1 5 





ACG GGG AAG ACG ATC ACC CTA GAG GTT GAG TCT TCC GAC ACC 986 
Thr Gly Lys Thr lie Thr Leu Glu Val Glu Ser Ser Asp Thr 
10 15 20 

ATC GAC AAT GTC AAA GCC AAG ATC CAG GAC AAG GAA GGG ATT 1028 
lie Asp Asn Val Lys Ala Lys He Gin Asp Lys Glu Gly lie 
25 30 * 35 

CCC CCA GAC CAG CAG CGT TTG ATT TTC GCC GGA AAG CAG CTT 1070 
Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys Gin Leu 
40 45 50 

GAG GAT GGT CGT ACT CTT GCC GAC TAC AAC ATC CAG AAG GAG 1112 
Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn He Gin Lys Glu 
55 60 

TCA ACT CTC CAT CTC GTG CTC CGT CTC CGT GGT GGT 1148 
Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly 
65 70 75 

GGA TCC GCT GTT AAA AGA GTG GGT CGT AGG TTG AAA AAG TTG 1190 
Gly Ser Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 
80 85 ' 90 

GAC CGT AAG ATT GAT AGG TTA GGA GTT GAT TTT TGATC 1228 
Asp Arg Lys He Asp Arg Leu Gly Val Asp Phe 
95 100 



(94) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1154 

(B) TYPE: NUCLEIC ACID 

(C) STRAND EDNE S S : DOUBLE STRANDED 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: GENOMIC DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 
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CCAAAGCACA 


TACTTATCGA 


TTTAAATTTC 


AT CG AAG AG A 


TTAATATCGA 


50 


ATAATCATAT 


ACATACTTTA 


AATACATAAC 


AAATTTTAAA 


TACATATATC 


100 


TGGTATATAA 


TTAATTTTTT 

x * •* * x x x a x x 


AAAGTCATGA 


AGTATGTATC 


AAATACACAT 


150 


ATGGAAAAAA 


TTAACTATTC 


ATAATTTAAA 


AAATAGAAAA 


GATACATCTA 


200 


GTGAAATTAG 


GTGCATGTAT 


CAAATACATT 


AGGAAAAGGG 


CATATATCTT 


250 


GATCTAGATA 


ATTAACGATT 

^%x innv.vjni x 


TTGATTTATG 

x x vj xxx Jr* -L vj 


TATAATTTCC 

x x iw» x x a 


AAATGAAGGT 


3 00 




TTPA/T!Zk AATA 


AfAATATAPT 

A^AAX A X ov X 


TTTATCAGAA 


CATTCAACAA 


350 


AGCAACAAC C 


AACTAGAGTG 

AAw X nU/iu X w 


AAAAATACAC 


ATTGTT CT CT 

x x a a ^» x x 


AGACATACAA 


400 


A ATTG AG A A A 

An X X AISAAM 


AGAATPTPAA 

nunn X w i. \__HA 


AATTTAGAGA 

AA XXX AV7AVJA 


AACAAATCTG 


AATTTPTAG A 


450 


avjaaaaaaax 


A ATT ATGP A P 

AA X X A X VJV— 


XXX uL X XX X X w 


CTCGAAAAAT 


AAATGAAAGA 


500 


& ATT AGAPTT 

An X X A\J A\». X X 


TTTTAAAAGA 
1 X X lAnnnwh 


TGTT AG ACT A 

x w X x w x s\ 


GATATACTCA 


AAAGCTATTA 


550 


AAGGAGTAAT 

AnUvAU x An x 


jv TT PTT PTT A 

AY X X w X X w X X 


CATTAAGTAT 

\_ A* X X *V»VVJ X A"k X 


TTTAGTTACA 


GTCCTGTAAT 


600 


T AAAG APA P A 

X AAnUnt>A\^l 


TTTTAfJATTfi 
x x x x nvjn x x vj 


TATCTAAACT 

x i <h x nnnv. x 


TAAATGTATC 


TAGAATACAT 


650 


ATATTTGAAT 


GCATCATATA 


CATGTATCCG 


ACACACCAAT 


TCTCATAAAA 


700 


AACGTAATAT 


CCTAAACTAA 


TTTATCCTTC 


AAGTCAACTT 


AAGCCCAATA 


750 


TACATTTTCA 


TCTCTAAAGG 


CCCAAGTGGC 


ACAAAATGTC 


AGGCCCAATT 


800 


ACGAAGAAAA 


GGGCTTGTAA 


AACCCTAATA 


AAGTGGCACT 


GGCAGAGCTT 


850 


ACACTCTCAT 


TCCATCAACA 


AAGAAACCCT 


AAAAGCCGCA 


GCGCCACTGA 


900 


TTTCTCTCCT 


CCAGGCGAAG 


ATG CAG ATC TTC GTG AAG ACC TTA 
Met Gin He Phe Val Lys Thr Leu 


944 



1 5 

ACG GGG AAG ACG ATC ACC CTA GAG GTT GAG TCT TCC GAC ACC 986 
Thr Gly Lys Thr He Thr Leu Glu Val Glu Ser Ser Asp Thr 
10 15 20 

ATC GAC AAT GTC AAA GCC AAG ATC CAG GAC AAG GAA GGG ATT 102 8 
lie Asp Asn Val Lys Ala Lys lie Gin Asp Lys Glu Gly He 
25 30 35 

CCC CCA GAC CAG CAG CGT TTG ATT TTC GCC GGA AAG CAG CTT 1070 
Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys Gin Leu 
40 45 50 

GAG GAT GGT CGT ACT CTT GCC GAC TAC AAC ATC CAG AAG GAG 1112 
Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn He Gin Lys Glu 
55 60 
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TCA ACT CTC CAT CTC GTG CTC CGT CTC CGT GGT GGT GGA TCC 1154 
Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Gly Ser 
65 70 75 

(95) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY : LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 94 

Met Gin lie Phe Val Lys Thr Leu 
1 5 

Thr Gly Lys Thr lie Thr Leu Glu Val Glu Ser Ser Asp Thr 
10 15 20 

lie Asp Asn Val Lys Ala Lys lie Gin Asp Lys Glu Gly lie 
25 30 35 

Pro Pro Asp Gin Gin Arg Leu lie Phe Ala Gly Lys Gin Leu 
40 45 50 

Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn lie Gin Lys Glu 
55 60 

Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly Gly Ser 
65 70 75 

(96) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2127 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE STRANDED 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: GENOMIC DNA AND OTHER DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 
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TTTATCAATC 


AGATTTGAAC 


ATATAAATAA 


ATATAAATTG 


TCTCAATAAT 


50 


TCTACATTAA 


ACTAATATTT 


GAAATCTCAA 


TTTTATGATT 


TTTTAAATTC 


100 


ACTTTATATC 


CAAGACAATT 


TNCANCTTCA 


AAAAGTTTTA 


TTAAANATTT 


150 


ACATTAGTTT 


TGTTGATGAG 


GATGACAAGA 


TNTTGGTCAT 


CAATTACATA 


200 


TACCCAAATT 


GAATAGTAAG 


CAACTTCAAT 


GTTTTTCATA 


ATGATAATGA 


250 


CAGACACAAN 


NNAAACCCAT 


TTATTATTCA 


CATTGATTGA 


GTTTTATATG 


300 
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PAATAT AfSTA 
V— nA X 2W X n 


ATA ATA AT fl A 
AX AnX Art 1 AA 


T1TTTCTT AT 
In* X lUi inl 


A& APP A A 1*2 AC2 
nnAu L/inunu 


/?Tra jattttt 

VJ 1 UiA 11111 


3 50 


x x x ixvil 1 Al 


APPAPflTPAP 


TAAATTATAT 

X nnn X X A X A X 


TTCIATAATfiT 

1 X un X Art X v> X 


A A A APA ATTP 


AAA 


AAATTTTA CT 


T AAAT ATP AT 

x nnn x x v~n x 


GAAATAAAPT 


ATTTTT ATA A 


PPAAATTAPT 
v_*AAA x X Av. 1 


den 


AAATT TTT C C 


AATAAA A A A A 

AA X AAAAAAA 




AAP.APATAAA 


AT A A ATTTfl A 
ninttnl X i v>A 






Tf2A AP/TPflAP 


T/2 A fTTTTTT 

iUMUl X X X X X 


TTTTTTT ATP 
i XXX X X 1 AX W 


ATA Aft A A A AT 
nlnnunnAA 1 


e e A 


AAATT ATT A A 


PTTTA A PPT A 


ATA A A APAPT 
A X nnnnwiv X 


A AT AT A ATTT 
nnlnlnni X X 


P ATflfS A ZxTPT 
X UV7AA X Vv 1 


DUU 


AA1AI~1 lnv»L 


TPTTAP A A AT 


A Af?A& A A AP.T 
AnuAAnAAIjl 


PTTTPT & AT & 
Lri 1 1 V.1AA1A 


fSTAPPf^Tf^TA ST 

vjAv^UG 1 LAA1 


CCA 


TTAP ATT A A A 


Inl 1 X ILnnl 


PAAATTTA A A 
\«AAAX X Innn 


TA AP A A liTAT 
1 AnUvin x A 1 


PA AT ATf*2APf2 
LAnlAl uAuu 


1 n A 


TP A AT 21 ir A A 


1A1 LAAAnlA 


IT ATP A A A A A 
Ax AluAAAAA 


JAP JAf^PA RTIP 
AuAuLAA 1 AC 


AT 71 UTTATlIiP 
Al AA1 Al AAG 


Ten 




AbluLbAl 1A 


1 LAAuolAo 1 


Al 1A1A1 LLl 


7A TV TTTf* PT TV X 

AA ill GC x AA 


800 


T 7a tttp nrr* r*T 
inl 1 1GNGCT 


CTTATATTTA 


AGGTCATGTT 


f> m T^ 7V T IV 11 3\ r* 

CATGATAAAC 


TTGAAATC5CC3 


850 


C 1 A I A xTAGA 


G CAT AT ATT A 


AAATAAAAAA 


UTTV ^*OTTV TV J\ TV 

ATACCTAAAA 


I AAAATTAAG 


900 


TT 7\ TTT 1*1* TV /"* 

1 1 ATT. TTT AG 


TATATATTTT 


TTT TV TV rp« 7. 

TTTACATGAC 


CTACATTTTT 


CTGGGTTTTT 


950 


v- 1 AAAbbAuL. 


/■'T/^TIV TVf T/"*T 

GTGTAAG xGT 


CGACCTCATT 


IV TTT 

CT C CTAATTT 


TCCCCACCAC 


1000 


A I AAAAATTA 


TV TV TV IV IV TV T\ /*• 

AAAAGGAAAG 


GTAGCTTTTG 


CGTGTTGTTT 


TGGTACACTA 


1050 


tALL 1 uAi I A 


TT A /""TV /"/"•T^T 


CCT CATfl TAA 


TTO ^TT IV IV »*• ^ 

TTGGTTAAC C 


CTATGAGGCG 


1100 


ITCGTCTA 


GAGTCGG C CA 


TG CCATCTAT 


AAAATG AAG C 


TTTCTGCACC 


1150 


TCATTTTTTT 


CAT CTT CT AT 


CTGATTT CT A 


TTATAATTTC 


TCTCAATTGC 


1200 


C I TwiAATTT 


^mpii 'TT T\ TV (""f* 

CT CTTTAAGG 


TT7V/""TV J\ Tf W I*T 

TTAGAAT CTT 


CTCTATTTTT 




1240 


uui 1 1 1 1 Lj I A 


in* ••pn «p * f 7v TT 

TGTTTAGATT 


CT CGAATTAG 


/~irr> 71 xTf'TV/T^/"* 

CTAATCAGGC 


GCTGTTATAG 


1290 




•pp JV^T/^Tf^Tr' 

1 GAG X G 1 L 1 C 


CT CGGTTGT C 


ffmp TV Tpp TV T\ TV 

1 x GA 1 GGAAA 


AGGLLi AACA 


1340 


1 J. lunul ill 


X 1 1 AUG lulu 


Gill GA 1 GGA 


AAAGGCC 1 AG 


AA1 1GGL.GG1 


1390 


TTTCCCCGTT 


CGTTTTGATG 


AAAAAGCCCC 


TAGTTTGAGA 


T TTTTTTT CT 


1440 


GTCGTTCGTT 


CTAAAGGTTT 


AAAATTAGAG 


TTTTTACATT 


TGTTTGATGA 


1490 


AAAAGGCCTT 


AAATTTGAGT 


TTTTCCGGTT 


GATTTGATGA 


AAAAGCCCTA 


1540 


GAATTTGTGT 


TTTTCCGTCG 


GTTTGATTCT 


GAAGGCCTAA 


AATTTGAGTT 


1590 


TCTCCGGCTG 


TTTTGATGAA 


AAAGCCCTAA 


ATTTGAGTTT 


CTCCGGCTGT 


1640 


TTTGATGAAA 


AAGCCCTAAA 


TTTGAAGTTT 


TTTCCCCGTG 


TTTTAGATTG 


1690 
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TTTAGGTTTT AATTCTCGAA TCAGCTAATC AGGGAGTGTG AAAGCCCTAA 1740 

ATTGAAGTTT TTTTCGTTGT TCTGATTGTT GTTTTTATGA ATTTGCAG 1788 

ATG CAG ATC TTT GTG AAA ACT CTC ACC GGA AAG ACT ATC ACC 1830 
Met Gin lie Phe Val Lys Thr Leu Thr Gly Lys Thr lie Thr 
15 10 

CTA GAG GTG GAA AGT TCT GAT ACA ATC GAC AAC GTT AAG GCT 1872 

Leu Glu Val Glu Ser Ser Asp Thr He Asp Asn Val Lys Ala 
15 20 25 

AAG ATC CAG GAT AAG GAA GGA ATT CCC CCG GAT CAG CAA AGG 1914 

Lys lie Glu Asp Lys Glu Gly He Pro Pro Asp Gin Gin Arg 

30 " 35 40 

CTT ATC TTC GCC GGA AAG CAG TTG GAG GAC GGA CGT ACT CTA 1956 
Leu He Phe Ala Gly Lys Gin Leu Glu Asp Gly Arg Thr Leu 
45 50 55 

GCT GAT TAC AAC ATC CAG AAG GAG TCT ACC CTC CAT TTG GTG 1998 
Ala Asp Tyr Asn He Gin Lys Glu Ser Thr Leu His Leu Val 
60 * 65 70 

CTC CGT CTA CGT GGA GGT GGA TCC GCT GTT AAA AGA GTG GGT 2040 
Leu Arg Leu Arg Gly Gly Gly Ser Ala Val Lys Arg Val Gly 
75 80 

CGT AGG TTG AAA AAG TTG GAC CGT AAG ATT GAT AGG TTA GGA 2082 
Arg Arg Leu Lys Lys Leu Asp Arg Lys He Asp Arg Leu Gly 
85 90 * 95 

GTT GAT TTT TGATCTAGAG TCGACCGATC CCCCGAATTT CCCCGA 2127 
Val Asp Phe 
100 



(97) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2022 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE STRANDED 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: GENOMIC DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 

TTTATCAATC AGATTTGAAC ATATAAATAA ATATAAATTG TCTCAATAAT 50 

TCTACATTAA ACTAATATTT GAAATCTCAA TTTTATGATT TTTTAAATTC 100 

ACTTTATATC CAAGACAATT TNCANCTTCA AAAAGTTTTA TTAAANATTT 150 

ACATTAGTTT TGTTGATGAG GATGACAAGA TNTTGGTCAT CAATTACATA 2 00 
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TACCCAAATT 


GAATAGTAAG 


CAACTTCAAT 


GTTTTTCATA 


ATGATAATGA 


250 


CAGACACAAN 


NNAAACCCAT 


TTATTATTCA 


CATTGATTGA 


GTTTTATATG 


300 


CAATATAGTA 


ATAATAATAA 


TATTTCTTAT 


AAAGCAAGAG 


GTCAATTTTT 


350 


TTTTAATTAT 


ACCACGTCAC 


TAAATTATAT 


TTGATAATGT 


AAAACAATTC 


400 


AAATTTTACT 


TAAATATCAT 


GAAATAAACT 


ATTTTTATAA 


CCAAATTACT 


450 


AAATTTTTCC 


AATAAAAAAA 


AGTCATTAAG 


AAGACATAAA 


ATAAATTTGA 


500 


GGTAAANGAG 


TGAAGTCGAC 


TGACTTTTTT 


TTTTTTTATC 


ATAAGAAAAT 


550 


AAATTATTAA 


CTTTAACCTA 


ATAAAACACT 


AATATAATTT 


CATGGAATCT 


600 


AATACTTACC 


TCTTAGAAAT 


AAGAAAAAGT 


GTTTCTAATA 


GACCCTCAAT 


650 


TTACATTAAA 


TATTTTCAAT 


CAAATTTAAA 


TAACAAATAT 


CAATATGAGG 


700 


TCAATAACAA 


TATCAAAATA 


ATATGAAAAA 


AGAGCAATAC 


ATAATATAAG 


750 


GGACGATTTA 


AGTGCGATTA 


TCAAGGTAGT 


ATTATATCCT 


AATTTGCTAA 


800 


TATTTGNGCT 


CTTATATTTA 

wx x«^x^»x x x ^» 


AGGTCATGTT 


CATGATAAAC 


TTGAAATGCG 


850 


CTATATTAGA 


GCATATATTA 


AAATAAAAAA 


ATACCTAAAA 


TAAAATTAAG 


900 


TTATTTTTAG 


TATATATTTT 


TTTACATGAC 


CTACATTTTT 


CTGGGTTTTT 


950 


CTAAAGGAGC 


GTGTAAGTGT 


CGACCTCATT 


CTCCTAATTT 


TCCCCACCAC 


1000 


ATAAAAATTA 


AAAAGGAAAG 


GTAGCTTTTG 


CGTGTTGTTT 


TGGTACACTA 


1050 


CACCTCATTA 


TTACACGTGT 


CCTCATATAA 


TTGGTTAACC 


CTATGAGGCG 


1100 


GTTTCGTCTA 

XXX J X - X M * 


GAGT CGGCCA 


TGCCATCTAT 


AAAATGAAGC 


TTTCTGCACC 


1150 


TCATTTTTTT 


CATCTTCTAT 


CTGATTTCTA 


TTATAATTTC 


TCTCAATTGC 


1200 


CTTCAAATTT 


CTCTTTAAGG 


TTAGAATCTT 


CT CTATTTTT 




1240 


GGTTTTTGTA 


TGTTTAGATT 


CTCGAATTAG 


CTAATCAGGC 


GCTGTTATAG 


1290 


CCCTTCCTTT 


TGAGTCTCTC 


CTCGGTTGTC 


TTGATGGAAA 


AGGCCTAACA 


1340 


TTTGAGTTTT 


TTTACGTCTG 


GTTTGATGGA 


AAAGGCCTAC 


AATTGGCCGT 


1390 


TTTCCCCGTT 


CGTTTTGATG 


AAAAAGCCCC 


TAGTTTGAGA 




1440 


GTCGTTCGTT 


CTAAAGGTTT 


AAAATTAGAG 


TTTTTACATT 


TGTTTGATGA 


1490 


AAAAGGCCTT 


AAATTTGAGT 


TTTTCCGGTT 


GATTTGATGA 


AAAAGCCCTA 


1540 


GAATTTGTGT 


TTTTCCGTCG 


GTTTGATTCT 


GAAGGCCTAA 


AATTTGAGTT 


1590 
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TCTCCGGCTG TTTTGATGAA AAAGCCCTAA ATTTGAGTTT CTCCGGCTGT 1640 

TTTGATGAAA AAGCCCTAAA TTTGAAGTTT TTTCCCCGTG TTTTAGATTG 1690 

TTTAGGTTTT AATTCTCGAA TCAGCTAATC AGGGAGTGTG AAAGCCCTAA 174 0 

ATTGAAGTTT TTTTCGTTGT TCTGATTGTT GTTTTTATGA ATTTGCAG 17B8 

ATG CAG ATC TTT GTG AAA ACT CTC ACC GGA AAG ACT ATC ACC 1830 
Met Gin He Phe Val Lys Thr Leu Thr Gly Lys Thr He Thr 
15 10 

CTA GAG GTG GAA AGT TCT GAT ACA ATC GAC AAC GTT AAG GCT 187 2 

Leu Glu Val Glu Ser Ser Asp Thr He Asp Asn Val Lys Ala 
15 20 25 

AAG ATC CAG GAT AAG GAA GGA ATT CCC CCG GAT CAG CAA AGG 1914 

Lys He Glu Asp Lys Glu Gly He Pro Pro Asp Gin Gin Arg 
30 35 40 

CTT ATC TTC GCC GGA AAG CAG TTG GAG GAC GGA CGT ACT CTA 1956 
Leu He Phe Ala Gly Lys Gin Leu Glu Asp Gly Arg Thr Leu 
45 50 " 55 

GCT GAT TAC AAC ATC CAG AAG GAG TCT ACC CTC CAT TTG GTG 1998 
Ala Asp Tyr Asn He Gin Lys Glu Ser Thr Leu His Leu Val 
60 ° 65 70 

CTC CGT CTA CGT GGA GGT GGA TCC 2022 
Leu Arg Leu Arg Gly Gly Gly Ser 
75 
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(9B) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
tiii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 

Lys Arg Lys Arg Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu 
1 5 10 15 

Ala Arg Lys lie Ala Arg Leu Gly Val Ala Lys Leu Ala Gly Leu Arg 
20 25 30 

Ala Val Leu Lys Phe 
35 

(99) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY : LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 

Ala Val Lys Arg Val Gly Arg Arg Leu Lys Lys Leu Asp Arg Lys lie 
1 5 10 15 

Asp Arg Leu Gly Val Asp Phe 
20 
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Claims 

What is claimed is: 

1 . A lytic peptide comprising a peptide having an 
amino acid sequsnce selected from SEQ ID NOs. 3 9 to 91 
and 97 to 98. 

2 . A recombinant DNA molecule comprising a 
molecule having a nucleotide sequence encoding a lytic 
peptide described by an amino acid sequence selected 
from SEQ ID NOs. 39 to 91 and 97 to 98. 

3 . A method of developing disease resistant 
plants comprising expressing the recombinant DNA 
molecule of claim 2 in a plant cell . 

4 . A method of developing disease resistant 
plants comprising expressing the lytic peptide of claim 
1 in a plant cell . 
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(56) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE : SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 

Phe Ala Lys Lys Phe Ala Lys Lys Phe Lys Lys Phe Ala Lys Lys Phe 
15 10 15 

Ala Lys Phe Ala Phe Ala Phe Lys Lys Lys Lys 
20 25 

(57) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE : SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 

Lys Lys Lys Lys Phe Ala Lys Lys Phe Ala Lys Lys Phe Lys Lys Phe 
1*5 10 15 

Ala Lys Lys Phe Ala Lys Phe Ala Phe Ala Phe 
20 * 25 
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(58) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 
<ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 

Phe Ala Arg Lys Phe Leu Lys Arg Phe Lys Lys Phe Val Arg Lys Phe 
1 5 10 15 

lie Arg Phe Ala Phe Leu Phe 
20 

(59) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION : NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 

Phe Ala Arg Lys Phe Leu Lys Arg Phe Lys Lys Phe Val Arg Lys Phe 
15 10 15 

He Arg Phe Ala Phe Leu Phe Lys Arg Lys Arg 
20 25 
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(60) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

( i i i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 

Lys Arg Lys Arg Phe Ala Arg Lys Phe Leu Lys Arg Phe Lys Lys Phe 
15 10 15 

Val Arg Lys Phe lie Arg Phe Ala Phe Leu Phe 
20 25 

(61) INFORMATION FOR SEQ ID NO: 60: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 0 

lie Ala Lys Lys lie Ala Lys Lys lie Lys Lys lie Ala Lys Lys He 
15 10 15 

Ala Lys He Ala He Ala He 
20 
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(62) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
{ iii ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 

lie Ala Lye Lys lie Ala Lys Lys lie Lys Lys He Ala Lys Lys He 
1*5 10 15 

Ala Lys He Ala He Ala He Lys Lys Lys Lys 
20 25 

(63) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE : SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

Lys Lys Lys Lys He Ala Lys Lys He Ala Lys Lys He Lys Lys He 
15 10 15 

Ala Lys Lys He Ala Lys He Ala He Ala He 
20 25 
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(64) INFORMATION FOR SEQ ID NO : 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

<x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 

lie Ala Arg Lys lie Leu Lys Arg He Lys Lys He Val Arg Lys Phe 
15 10 15 

He Arg He Ala He Leu He 
20 

(65) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 

He Ala Arg Lys He Leu Lys Arg He Lys Lys He Val Arg Lys Phe 
15 10 15 

He Arg He Ala He Leu He Lys Arg Lys Arg 
20 25 



SUBSTITUTE SHEET (RULE 26) 



81 



WO 96/03522 



PCT/US95/09338 



(66) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 
<ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

Lys Arg Lys Arg lie Ala Arg Lys He Leu Lys Arg He Lys Lys He 
15 10 15 

Val Arg Lys Phe He Arg He Ala He Leu He 
20 25 

(67) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 
15 10 15 

Leu 
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(68) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii ) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg 
l 5 10 15 

Ala Lys He Lys Leu 
20 

(69) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION : PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 
1 5 * 10 15 

Leu Lys Arg Lys Arg 
20 
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(70) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 

!B) TYPE: AMINO ACID 
(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i i i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 

Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg Ala Lys lie Lys 
1 5 10 15 

Leu Arg Val Lys Leu Lys lie 
20 

(71) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i i i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 

Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg Ala Lys lie Lys 
1 5 10 15 

Leu Arg Val Lys Leu Lys lie Lys Arg Lys Arg 

20 25 
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(72) INFORMATION FOR SEQ ID NO: 71: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

( ii i ) HYPOTHETI CAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg 
15 10 15 

Ala Lys He Lys Leu Arg Val Lys Leu Lys He 
20 25 

(73) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
!vi) ORIGINAL SOURCE: SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 
1 "5 10 15 

Leu Arg Val Lys Leu Lys He Arg Ala Arg He Lys Leu 
20 25 
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(74) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE : SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 

Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg Ala Lys lie Lys 
1 5 10 * 15 

Leu Arg Val Lys Leu Lys lie Arg Ala Arg lie Lys Leu Lys Arg Lys 
20 25 30 

Arg 

(75) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE : SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg 
1 " " 5 10 15 

Ala Lys lie Lys Leu Arg Val Lys Leu Lys lie Arg Ala Arg lie Lys 
20 25 30 

Leu 
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(76) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 
I 5 10 15 

Leu Val Phe Ala He Leu Leu 
20 

(77) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 

Phe Lys Leu Arg Ala Lys He Lys Val Arg Leu Arg Ala Lys He Lys 
15 10 15 

Leu Val Phe Ala He Leu Leu Lys Arg Lys Arg 
20 25 
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(78) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

( C ) TOPOLOGY : LINEAR 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: PEPTIDE 

( iii ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 

Lys Arg Lys Arg Phe Lys Leu Arg Ala Lys lie Lys Val Arg Leu Arg 
1 5 10 15 

Ala Lys lie Lys Leu Val Phe Ala lie Leu Leu 
20 25 

(79) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

Val Phe Ala lie Leu Leu Phe Lys Leu Arg Ala Lys lie Lys Val Arg 
15 10 15 

Leu Arg Ala Lys lie Lys Leu 
20 
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(8 0) INFORMATION FOR SEQ ID NO: 7 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

( vi i ) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 

Val Phe Ala lie Leu Leu Phe Lys Leu Arg Ala Lys He Lys Val Arg 
1 5 10 15 

Leu Arg Ala Lys He Lys Leu Lys Arg Lys Arg 
20 25 

(81) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 

Lys Arg Lys Arg Val Phe Ala He Leu Leu Phe Lys Leu Arg Ala Lys 
15 10 15 

He Lys Val Arg Leu Arg Ala Lys He Lys Leu 
20 * 25 
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(B2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 

Val Gly Glu Cys Val Arg Gly Arg Cys Pro Ser Gly Met Cys Cys Ser 
15 10 15 

Gin Phe Gly Tyr Cys Gly Lys Gly Pro Lys Tyr Cys Gly 
20 25 

(83) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
< iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 

Val Gly Glu Cys Val Arg Gly Arg Cys Pro Ser Gly Met Cys Cys Ser 
1 5 10 15 

Gin Phe Gly Tyr Cys Gly Lys Gly Pro Lys Tyr Cys Gly Arg 
20 25 30 
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(84) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: B3 



Leu Gly Asp Cys Leu Lys Gly Lys 
1 5 

Asn Tyr Gly Phe Cys Gly Arg Gly 
20 



Cys Pro Ser Gly Met Cys Cys Ser 
10 " * 15 

Pro Arg Phe Cys Gly Lys 
25 " 30 



(85) INFORMATION FOR SEQ ID NO: 84: 
<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 7 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 

Gin Cys lie Gly Asn Gly Gly Arg Cys Asn Glu Asn Val Gly Pro Pro 
15 10 15 

Tyr Cys Cys Ser Gly Phe Cys Leu Arg Gin Pro Gly Gin Gly Tyr Gly 
20 25 30 * 

Tyr Cys Lys Asn Arg 
35 
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(86) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

( ii i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 

Cys lie Gly Asn Gly Gly Arg Cys Asn Glu Asn Val Gly Pro Pro Tyr 
1 5 10 15 

Cys Cys Ser Gly Phe Cys Leu Arg Gin Pro Asn Gin Gly Tyr Gly Val 
20 25 30 

Cys Arg Asn Arg 
35 

(87) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v> FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 

Cys lie Gly Gin Gly Gly Lys Cys Gin Asp Gin Leu Gly Pro Pro Phe 
1 ' 5 10 15 

Cys Cys Ser Gly Tyr Cys Val Lys Asn Pro Gin Asn Gly Phe Gly Leu 
20 25 30 

Cys Lys Gin Lys 
35 
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(18) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

( iii ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
1 5 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Lya Val Ala Val Ala Lys 
20 25 30 

Val Ala Val Ala Val 
35 

(19) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val Ala Lys Lys Val 

15 10 15 

Ala Lys Val Ala Val Ala Val 
20 
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(20) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

Phe Val Lys Lys Val Ala Lys Lys val Lys Lys Val Ala Lys Lys Val 
1 5 10 15 

Ala Lys Val Ala Val Ala Lys Val Ala Val Ala Val 
20 25 

(21) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val Ala Lys Lys Val 
1 5 10 15 

Ala Lys Val Ala Val Ala Lys Val Ala Val Ala Lys Val Ala Val Ala 
20 25 30 

Val 
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(22) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i i i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii ) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION : NOT PREVIOUSLY PUBLISHED 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val 
1 " 5 10 15 

Ala Lys Val Ala Lys Lys Val Ala Lys Lys Val 
20 25 

(23) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val 
15 10 15 

Ala Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala 
20 25 * 30 
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(24) INFORMATION FOR SEQ ID NO: 23: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

Lys Lys Lys Lys Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val 
15 10 15 

Ala Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala 
20 25 30 

Lys Val Ala Lys Lys 
35 

(25) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
15 10 15 

Lys Lys Val Ala Lys Lys Val 
20 
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(26) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

( vi i ) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
15 10 15 

Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala 
20 25 

(27) INFORMATION FOR SEQ ID NO: 2G: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
1 5 10 15 

Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Val Ala Lys 
20 25 30 
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(28) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

<x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
15 10 15 

Lys Lys Val Ala Lys Lys Val Lys Lys Lys Lys 
20 " 25 

(29) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii ) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
1 5 10 15 

Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Lys Lys Lys 
20 25 30 
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(30) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( i i i ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

Phe Val Lys Lys Val Ala Lys Val Ala Lys Lys Val Ala Lys Val Ala 
1 5 10 15 

Lys Lys Val Ala Lys Lys Val Ala Lys Lys Val Ala Lys Val Ala Lys 
20 25 30 

Lys Lys Lys Lys Lys 
35 

(31) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 



Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Lys Lys Lys Lys 
1 5 10 15 
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(32) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 21 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

<x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
1 5 10 15 

Ala Lys Lys Lys Lys 
20 

(33) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE : SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
1 5 * 10 15 

Ala Lys Val Lys Ala Lys Val Lys Lys Lys Lys 
20 25 
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(34) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO. 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 
(via) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
1 5 10 

(35) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
15 10 15 

Ala 
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(36) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
15 10 15 

Ala Lys Val Lys Ala Lys Val 
20 

(37) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 16 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT. TYPE : COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 

Lys Lys Lys Lys Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
1 5 10 * 15 
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(38) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE : COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

Lys Lys Lys Lys Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
15 10 15 

Ala Lys Val Lys Ala 
20 

(3 9) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 8 

Lys Lys Lys Lys Phe Lys Val Lys Ala Lys Val Lys Ala Lys Val Lys 
1 5 10 15 

Ala Lys Val Lys Ala Lys Val Lys Ala Lys Val 
20 25 
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(40) INFORMATION FOR SEQ ID NO: 3 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

<v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 

Phe Lys Lys Val Lys Lys Val Ala Lys Lys Val Cys Lys Cys Val Lys 
15 10 15 

Lys Ala Val Lys Lys Val Lys Lys Phe 
20 25 

(41) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 

Phe Ala Val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 
1 5 10 15 

Lys Ala Val Lys Lys Ala Val Cys Cys Cys Cys 
20 25 
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(42) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 

Cys Cys Cys Cys Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
1 5 ' 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Val 
20 25 

(43) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 

Phe Ala Val Ala Val Lys Ala Val Lys Lys Ala Val Lys Lys Val Lys 
15 10 15 

Lys Ala Val Lys Lys Ala val ser Ser Ser Ser 
20 25 
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(44) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 

<B) TYPE: AMINO ACID 
(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE : SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

Ser Ser Ser Ser Phe Val Lys Lys Val Ala Lys Lys Val Lys Lys Val 
1 5 10 15 

Ala Lys Lys Val Ala Lys Val Ala Val Ala Val 
20 25 

(45) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 
!vi) ORIGINAL SOURCE: SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 

Phe Ala Leu Ala Leu Lys Ala Leu Lys Lys Ala Leu Lys Lys Leu Lys 
15 10 15 

Lys Ala Leu Lys Lys Ala Leu 
20 
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(46) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION : NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 

Leu Ala Lys Lys Leu Ala Lys Lys Leu Lys Lys Leu Ala Lys Lys Leu 
15 10 15 

Ala Lys Leu Ala Leu Ala Phe 
20 

(47) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(V) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 

Phe Ala Phe Ala Phe Lys Ala Phe Lys Lys Ala Phe Lys Lys Phe Lys 
1 5 10 15 

Lys Ala Phe Lys Lys Ala Phe 
20 
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(48) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 
(ii> MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
( iii ) HYPOTHETICAL : NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

Phe Ala lie Ala lie Lys Ala lie Lys Lys Ala lie Lys Lys lie Lys 
15 10 15 

Lys Ala lie Lys Lys Ala lie 
20 

(49) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 
(iii (HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 
(Vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 

Phe Ala Lys Lys Phe Ala Lys Lys Phe Lys Lys Phe Ala Lys Lys Phe 
15 10 15 

Ala Lys Phe Ala Phe Ala Phe 
20 
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(52) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 
(ii> MOLECULE TYPE : 

(A) DESCRIPTION: PEPTIDE 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

<x) PUBLICATION INFORMATION : NOT PREVIOUSLY PUBLISHED 
(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 51 

Phe Ala Arg Ala Arg Lys Ala Arg Lys Lys Ala Arg Lys Lys Arg Lys 
15 10 15 

Lys Ala Arg Lys Lys Ala Arg Lys Asp Arg 
20 25 

(53) INFORMATION FOR SEQ ID NO: 52: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

<v) FRAGMENT TYPE: COMPLETE PEPTIDE 
(Vi) ORIGINAL SOURCE: SYNTHETIC 
(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 

Phe Ala Val Ala Val Cys Ala Val Cys Cys Ala Val Cys Cys Val Cys 
15 10 15 

Cys Ala Val Cys Cys Ala Val 
20 
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(54) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(X) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 

Phe Ala Val Ala Val Ser Ala Val Ser Ser Ala Val Ser Ser Val Ser 
15 10 15 

Ser Ala Val Ser Ser Ala Val 
20 

(55) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PEPTIDE 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: COMPLETE PEPTIDE 

(vi) ORIGINAL SOURCE: SYNTHETIC 

(vii) IMMEDIATE SOURCE: SYNTHETIC 

(x) PUBLICATION INFORMATION: NOT PREVIOUSLY PUBLISHED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 

Phe Ala Val Ala Val Ser Ala Val Ser Ser Ala Val Ser Ser Val Ser 
15 10 15 

Ser Ala Val Ser Ser Ala Val Ser Ser Ser Ser 
20 25 
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